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Preface 


You  are  probably  about  to  teach  or  take  a  “first  course  in  proof  techniques,”  or 
maybe  you  just  want  to  learn  more  about  mathematics.  No  matter  what  the  reason,  a 
student  who  wishes  to  learn  the  material  in  this  book  likes  mathematics  and  we  hope 
to  keep  it  that  way.  At  this  point,  students  have  an  intuitive  sense  of  why  things  are 
true,  but  not  the  exposure  to  the  detailed  and  critical  thinking  necessary  to  survive 
in  the  mathematical  world.  We  have  written  this  book  to  bridge  this  gap. 

In  our  experience,  students  beginning  this  course  have  little  training  in  rigorous 
mathematical  reasoning;  they  need  guidance.  At  the  end,  they  are  where  they  should 
be;  on  their  own.  Our  aim  is  to  teach  the  students  to  read,  write,  and  do  mathematics 
independently,  and  to  do  it  with  clarity,  precision,  and  care.  If  we  can  maintain  the 
enthusiasm  they  have  for  the  subject,  or  even  create  some  along  the  way,  our  book 
has  done  what  it  was  intended  to  do. 

Reading.  This  book  was  written  for  a  course  we  teach  to  first-  and  second-year 
college  students.  The  style  is  informal.  A  few  problems  require  calculus,  but  these 
are  identified  as  such.  Students  will  also  need  to  participate  while  reading  proofs, 
prodded  by  questions  (such  as,  “Why?”).  Many  detailed  examples  are  provided  in 
each  chapter.  Since  we  encourage  the  students  to  draw  pictures,  we  include  many  il¬ 
lustrations  as  well.  Exercises,  designed  to  teach  certain  concepts,  are  also  included. 
These  can  be  used  as  a  basis  for  class  discussion,  or  preparation  for  the  class.  Stu¬ 
dents  are  expected  to  solve  the  exercises  before  moving  on  to  the  problems.  Com¬ 
plete  solutions  to  all  of  the  exercises  are  provided  at  the  end  of  each  chapter.  Prob¬ 
lems  of  varying  degrees  of  difficulty  appear  at  the  end  of  each  chapter.  Some  prob¬ 
lems  are  simply  proofs  of  theorems  that  students  are  asked  to  read  and  summarize; 
others  supply  details  to  statements  in  the  text.  Though  many  of  the  remaining  prob¬ 
lems  are  standard,  we  hope  that  students  will  solve  some  of  the  unique  problems 
presented  in  each  chapter. 

Writing.  The  bad  news  is  that  it  is  not  easy  to  write  a  proof  well.  The  good  news 
is  that  with  proper  instruction,  students  quickly  learn  the  basics  of  writing.  We  try 
to  write  in  a  way  that  we  hope  is  worthy  of  imitation,  but  we  also  provide  students 
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with  “tips”  on  writing,  ranging  from  the  (what  should  be)  obvious  to  the  insider’s 
preference  (“Don’t  start  a  sentence  with  a  symbol.”). 

Proving.  How  can  someone  learn  to  prove  mathematical  results?  There  are  many 
theories  on  this.  We  believe  that  learning  mathematics  is  the  same  as  learning  to  play 
an  instrument  or  learning  to  succeed  at  a  particular  sport.  Someone  must  provide  the 
background:  the  tips,  information  on  the  basic  skills,  and  the  insider’s  “know-how.” 
Then  the  student  has  to  practice.  Musicians  and  athletes  practice  hours  a  day,  and 
it’s  not  surprising  that  most  mathematicians  do,  too.  We  will  provide  students  with 
the  background;  the  exercises  and  problems  are  there  for  practice.  The  instructor 
observes,  guides,  teaches  and,  if  need  be,  corrects.  As  with  anything  else,  the  more 
a  student  practices,  the  better  she  or  he  will  become  at  solving  problems. 

Using  this  book.  What  should  be  in  a  book  like  this  one?  Even  a  quick  glance  at 
other  texts  on  this  subject  will  tell  you  that  everyone  agrees  on  certain  topics:  logic, 
quantifiers,  basic  set  theoretic  concepts,  mathematical  induction,  and  the  definition 
and  properties  of  functions.  The  depth  of  coverage  is  open  to  debate,  of  course.  We 
try  to  cover  logic  and  quantifiers  fairly  quickly,  because  we  believe  that  students 
can  only  fully  appreciate  the  fundamentals  of  mathematics  when  they  are  applied  to 
interesting  problems. 

What  is  also  apparent  is  that  after  these  essential  concepts,  everyone  disagrees 
on  what  should  be  included.  Even  we  prefer  to  vary  our  approach  depending  on  our 
students.  We  have  tried  to  provide  enough  material  for  a  flexible  approach. 

•  The  Minimal  Approach.  If  you  need  only  the  basics,  cover  Chapters  1-18.  (If  you 
assume  the  well  ordering  principle,  or  decide  to  accept  the  principle  of  mathe¬ 
matical  induction  without  proof,  you  can  also  omit  Chapters  12  and  13.) 

•  The  Usual  Approach.  This  approach  includes  Chapters  1-18  and  Chapters  21 — 
24.  (This  is  easily  doable  in  a  standard  semester,  if  the  class  meets  three  hours 
per  week.) 

•  The  Algebra  Approach.  For  an  algebraic  slant  to  the  course,  cover  Chapters  1-18, 
omitting  Chapter  13  and  including  Chapters  27  and  28. 

•  The  Analysis  Approach.  For  a  slant  toward  analysis,  cover  Chapters  1-23.  (In¬ 
clude  Chapter  24,  if  time  allows.  This  is  what  we  usually  cover  in  our  course.) 
Include  as  much  material  from  Chapters  25  and  26  as  time  allows.  Students  usu¬ 
ally  enjoy  an  introduction  to  metric  spaces. 

•  Projects.  We  have  included  projects  intended  to  let  students  demonstrate  what 
they  can  do  when  they  are  on  their  own.  We  indicate  prerequisites  for  each 
project,  and  have  tried  to  vary  them  enough  that  they  can  be  assigned  throughout 
the  semester.  The  results  in  these  projects  come  from  different  areas  that  we  find 
particularly  interesting.  Students  can  be  guided  to  a  project  at  their  level.  Since 
there  are  open-ended  parts  in  each  project,  students  can  take  these  projects  as  far 
as  they  want.  We  usually  encourage  the  students  to  work  on  these  in  groups. 

•  Notation.  A  word  about  some  of  our  symbols  is  in  order  here.  In  an  attempt  to 
make  this  book  user-friendly,  we  indicate  the  end  of  a  proof  with  the  well-known 
symbol  □.  The  end  of  an  example  or  exercise  is  designated  by  O-  If  a  problem  is 
used  later  in  the  text,  we  designate  it  by  Problem#.  We  also  have  a  fair  number 
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of  “nonproofs.”  These  are  “proofs”  with  errors,  gaps,  or  both;  the  students  are 
asked  to  find  the  flaw  and  to  fix  it.  We  conclude  such  “proofs”  with  the  symbol 
0 .  Every  other  symbol  will  be  defined  when  we  introduce  you  to  it.  Definitions 
are  incorporated  in  the  text  for  ease  of  reading  and  the  terms  defined  are  given  in 
boldface  type. 

Presenting.  We  also  hope  that  students  will  make  the  transition  to  thinking  of 
themselves  as  members  of  a  mathematical  community.  We  encourage  the  students 
we  have  in  this  class  to  attend  talks,  give  talks,  go  to  conferences,  read  mathemat¬ 
ical  books,  watch  mathematical  movies,  read  journal  articles,  and  talk  with  their 
colleagues  about  the  things  in  this  course  that  interest  them.  Our  (incomplete,  but 
lengthy)  list  of  references  should  serve  a  student  well  as  a  starting  point.  Each  of  the 
projects  works  well  as  the  basis  of  a  talk  for  students,  and  we  have  included  some 
background  material  in  each  section.  We  begin  the  chapter  on  projects  with  some 
tips  on  speaking  about  mathematics. 

What’s  new  in  this  edition.  We  have  made  many  changes  to  the  first  edition.  First, 
all  exercises  now  have  solutions  and  every  chapter,  except  for  the  first,  has  at  least 
twenty  problems  of  varying  difficulty.  As  a  result,  the  text  has  now  roughly  twice 
as  many  problems  than  before.  As  in  the  first  edition,  definitions  are  incorporated  in 
the  text.  In  this  edition,  all  definitions  newly  introduced  in  a  chapter  appear  again  in 
a  section  with  formal  statements  of  the  new  definitions.  We  have  included  a  detailed 
description  of  definitions  by  recursion  and  a  recursion  theorem.  We’ve  added  axioms 
of  set  theory  to  the  appendix.  We  have  included  new  projects:  one  on  the  axiom  of 
choice  and  one  on  complex  numbers.  We  have  added  some  interesting  pieces  to  two 
projects.  Picture  Proofs  and  The  Best  Number  of  All  (and  Some  Other  Pretty  Good 
Ones). 

Some  chapters  have  been  changed  or  added.  The  first  edition’s  Chapter  12,  which 
required  more  of  students  than  previous  chapters,  has  been  broken  into  two  chapters, 
now  enumerated  Chapters  12  and  13.  If  the  instructor  wishes,  it  is  possible  to  simply 
assume  the  results  in  Chapter  13  and  omit  the  chapter.  We  have  also  included  a  new 
chapter.  Chapter  24,  on  the  Cantor-Schroder-Bernstein  theorem.  We  feel  that  this 
is  the  proper  culmination  to  Chapters  21-23  and  a  wonderful  way  to  end  the  course, 
but  be  forewarned  that  it  is  not  an  easy  chapter. 

Thanks  to  many  of  you  who  used  the  text,  we  were  able  to  pinpoint  areas  where 
we  could  improve  many  of  our  explanations,  provide  more  motivation,  or  present  a 
different  perspective.  Our  goal  was  to  find  simpler,  more  precise  explanations,  and 
we  hope  that  we  have  been  successful.  One  new  feature  of  this  text  that  may  interest 
instructors  of  the  course;  We  have  written  solutions  to  every  third  problem.  These 
are  available  on  our  website  (see  below). 

Of  course,  we  have  updated  our  reference  list,  made  corrections  to  errors  that 
appeared  in  the  first  version,  and,  most  likely,  introduced  new  errors  in  the  second 
version.  We  hope  you  will  send  us  corrections  to  errors  that  you  find  in  the  text,  as 
well  as  any  suggestions  you  have  for  improvement. 

We  hope  that  through  reading,  writing,  proving,  and  presenting  mathematics,  we 
can  produce  students  who  will  make  good  colleagues  in  every  sense  of  the  word. 
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Chapter  1 

The  How,  When,  and  Why  of  Mathematics 


What  is  mathematics?  Many  people  think  of  mathematics  (incorrectly)  as  addition, 
subtraction,  multiplication,  and  division  of  numbers.  Those  with  more  mathematical 
training  may  think  of  it  as  dealing  with  algorithms.  But  most  professional  mathe¬ 
maticians  think  of  it  as  much  more  than  that.  While  we  certainly  hope  that  our 
students  will  perform  algorithms  correctly,  what  we  really  want  is  for  them  to  un¬ 
derstand  three  things:  how  you  do  something,  why  it  works,  and  when  it  works. 
The  problems  we  present  to  you  in  this  book  concentrate  on  these  three  goals.  If 
this  is  the  first  time  you  have  been  asked  to  prove  theorems,  you  may  find  this  to 
be  quite  a  challenge.  Not  only  will  you  be  learning  how  to  solve  the  problem,  you 
will  also  be  learning  how  to  write  up  the  solution.  The  necessary  definitions  and 
background  to  understand  a  problem,  as  well  as  a  general  plan  of  attack,  will  always 
be  presented  in  the  text.  It’s  up  to  you  to  spend  the  time  reading,  trying  various  ap¬ 
proaches,  rereading,  and  reapproaching.  You  will  probably  be  spending  more  time 
on  fewer  exercises  than  you  ever  have  before.  While  you  are  now  beyond  the  stage 
of  being  given  steps  to  follow  and  practice,  there  are  general  rules  that  can  assist  you 
in  your  transition  to  doing  higher  mathematics.  Many  people  have  written  about  this 
subject  before.  The  classic  text  on  how  to  approach  a  problem  is  a  wonderful  book 
called  How  to  Solve  It  by  George  Polya,  [84]. 

In  his  text,  Polya  gives  a  list  of  guidelines  for  solving  mathematical  problems.  He 
calls  his  suggestions  “the  list.”  We  have  included  the  original  in  the  Appendix  on 
page  366.  This  list  has  served  as  a  guide  for  several  generations  of  mathematicians, 
and  we  suggest  that  you  let  it  guide  you  as  well.  Here’s  a  closer  look  at  “the  list” 
with  some  21st-century  modifications. 

First.  “Understanding  the  problem.”  Easier  said  than  done,  of  course.  What 
should  you  do?  Make  sure  you  know  what  all  the  words  mean.  You  may  need  to 
look  something  up  in  this  book,  or  you  may  need  to  use  another  book.  Look  at  the 
statement  to  figure  out  carefully  what  you  are  given  and  what  you  are  supposed  to 
figure  out.  If  a  picture  will  help,  draw  it.  Will  you  be  proving  something?  What? 
Will  you  have  to  obtain  an  example?  Of  what?  Check  all  conditions.  Will  you  have 
to  show  that  something  is  false?  Once  you  understand  what  you  have  to  do,  you  can 
move  on  to  the  next  step. 
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1  The  How,  When,  and  Why  of  Mathematics 


Second.  “Devising  a  plan.”  How  will  you  attack  the  problem?  At  this  point,  you 
understand  what  must  be  done  (because  you  have  completed  Step  1).  Have  you  seen 
something  like  it  before?  If  you  haven’t  looked  over  class  notes,  haven’t  read  the 
text,  or  haven’t  done  the  previous  homework  assignments,  the  odds  are  slim  that 
you  have  seen  anything  that  will  be  helpful.  Do  all  that  first.  Look  over  the  text  with 
the  problem  in  mind,  read  over  your  notes  with  the  problem  in  your  head,  look  at 
previous  exercises  and  theorems  that  sound  similar.  Maybe  you  can  use  some  of 
the  ideas  in  the  proof  of  a  theorem,  or  maybe  you  can  use  a  previous  homework 
problem.  Mathematics  builds  on  itself  and  the  problems  in  the  text  will  also.  If  you 
are  truly  stuck,  try  to  answer  a  simpler,  similar  question.  Once  you  decide  on  a 
method  of  approach,  try  it  out. 

Third.  “Carrying  out  the  plan.”  Solve  the  problem.  Look  at  your  solution.  Is  each 
sentence  true?  Sometimes  it  is  difficult  to  catch  an  error  right  after  you  have  “found 
a  solution.”  Put  the  problem  down  and  come  back  to  it  a  few  hours  later.  Is  each 
sentence  still  true? 

Fourth.  “Looking  back.”  Polya  suggests  checking  the  result  and  the  argument,  or 
even  looking  for  a  different  proof.  If  you  are  allowed  (check  with  your  teacher),  one 
really  good  way  to  check  a  proof  is  to  give  it  to  someone  else.  You  can  present  it 
to  friends.  Even  if  they  don’t  understand  a  word  you  are  saying,  sometimes  saying 
it  out  loud  in  a  coherent  manner  will  allow  you  to  recognize  an  error  you  can’t  spot 
when  you  are  reading.  If  you  are  permitted  to  work  together,  switch  proofs  and  ask 
your  partner  for  criticism  of  your  proof. 

When  you  are  convinced  that  your  argument  is  correct,  it  is  time  to  write  up  a 
correct  and  neat  solution  to  the  problem. 

Here  is  an  example  of  the  Polya  method  at  work  in  mathematics;  we  will  decipher 
a  message.  A  cipher  is  a  system  that  is  used  to  hide  the  meaning  of  a  message  by 
replacing  the  letters  of  the  alphabet  by  other  letters  or  symbols. 

Exercise  1.1.  The  following  message  is  encoded  by  a  shift  of  the  alphabet;  that  is, 
every  letter  is  replaced  by  another  one  that  has  been  shifted  n  places  further  down 
the  alphabet.  Once  we  reach  the  end  of  the  alphabet,  we  start  over.  For  instance,  if  n 

were  7,  we  would  make  the  replacements  a  — ►  h,  b  — ►  i,  . . . ,  s  — >  z,  t  — ►  a, Now 

the  exercise:  What  does  the  message  below  say? 

PDEO  AJYKZEJC  WHCKNEPDI  EO  YWHHAZ  W  YWAOWN  YELDAN. 

EP  EO  RANU  AWOU  PK  XNAWG,  NECDP? 

Let’s  use  the  ideas  from  Polya’s  list  to  solve  this.  If  you  have  solved  problems 
like  this  before,  it  might  be  a  better  exercise  for  you  to  try  on  your  own  to  see  how 
this  fits  Polya’s  method  before  you  read  on. 

1.  “Understanding  the  problem.”  Each  sequence  of  letters  with  no  blank  space  be¬ 
tween  the  letters  represents  one  word.  Each  letter  is  shifted  by  the  same  number 
of  places:  namely,  n.  So  n  is  the  unknown  in  this  problem  and  it  is  what  we  need 
to  find.  Once  we  know  the  value  of  n,  we  can  decipher  the  whole  message.  In 
addition,  once  we  know  the  meaning  of  one  letter,  we  can  find  the  value  for  n. 
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2.  “Devising  a  plan.”  A  cipher  text  may  have  weak  points.  What  are  these?  How 
about  the  short  words?  Looking  at  the  short  words,  in  some  sense,  substitutes  an 
easier  problem  for  the  one  we  have. 

3.  “Carrying  out  the  plan.”  The  short  words  are: 

W; 

EO  (which  appears  twice); 

EP; 

PK. 

Try  using  the  most  common  one-  and  two-letter  words.  For  each  guess,  check 
the  beginning  of  the  cipher  text  to  see  if  it  makes  sense.  It  shouldn’t  take  long  for 
you  to  come  up  with  the  message. 

4.  “Looking  back.”  If  your  solution  makes  sense,  then  it  is  highly  unlikely  that  a 
different  replacement  is  also  possible.  So  the  solution  is  (with  high  probability) 
the  only  one. 

Would  there  have  been  other  solution  methods?  Sure.  For  instance,  not  all  letters 
have  the  same  frequency  in  the  English  language.  One  analysis  of  English  texts 
showed  the  letter  e  occurring  most  frequently,  followed  by  (in  this  order)  t,  a,  o, 
i,  n,  s,  h,  and  r.  (See  [99,  p.  19].)  We  could  have  used  this  information  to  guess 
the  assignment  of  letters. 

We  also  could  have  simply  tried  one  value  of  n  after  another  until  the  message 
made  sense. 

Have  you  now  solved  the  problem?  If  you  know  what  the  message  says,  then 
the  answer  to  this  question  is  yes.  Are  you  done?  Unless  you  solved  the  problem 
and  wrote  up  a  clear,  complete  solution,  the  answer  to  this  second  question  is  no. 
A  solution  consists  of  a  report  that  tells  the  reader  how  you  solved  the  problem  and 
what  the  answer  is.  This  needs  to  be  done  in  clear  English  sentences.  As  you  write 
up  your  solution,  try  to  keep  the  reader  in  mind.  You  should  explain  things  clearly 
and  logically,  so  that  the  reader  doesn’t  have  to  spend  time  filling  in  gaps.  Q 

We  now  move  on  to  a  very  different  kind  of  example.  Consider  the  set  of  points  in 
three-space.  In  case  you  haven’t  seen  this  before,  these  points  are  easily  described. 
We  take  the  familiar  xy-plane,  and  place  it  parallel  to  the  floor.  The  z-axis  is  the 
vertical  line  perpendicular  to  the  xy-plane  and  passing  through  the  origin  of  the  xy- 
plane  (see  Figure  1.1).  We’ll  review  the  important  concepts  before  we  begin  our 
example. 

To  locate  a  point,  we  will  give  three  coordinates.  The  first  coordinate  is  the  x- 
coordinate  and  tells  us  the  number  of  units  to  walk  in  the  x-direction.  The  second 
is  the  y-coordinate,  telling  us  how  to  move  in  the  y-direction,  and  the  third  is  the 
z-coordinate,  telling  us  how  far,  up  or  down,  to  move.  So  a  point  in  three-space  is 
denoted  by  (x, y, z) .  It  is  important  to  make  sure  you  understand  this.  Try  to  think 
of  how  you  would  plot  points.  The  point  (1,0,0)  (plotted  in  Figure  1.2)  would  ap¬ 
pear  one  unit  in  the  positive  direction  on  the  x-axis  (since  it  doesn’t  move  in  the 
y-direction  or  z-direction  at  all).  The  point  (—1,1,0)  would  appear  in  the  xy-plane. 
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one  unit  back  on  the  x-axis  and  one  unit  in  the  positive  y-direction.  Finally  the  point 
(2, —  1,3)  is  plotted  in  Figure  1.2. 

Let’s  go  a  bit  further  here.  In  two-space,  what  was  x  =  0?  Since  y  does  not  appear 
in  that  equation,  it  is  unrestricted  and  can  be  any  real  number.  That’s  why  x  =  0  in 
two-space  is  the  y-axis.  What  is  x  =  3?  It  is  a  line  parallel  to  the  v-axis  through  the 
point  (3,0).  So,  let’s  try  to  generalize  this  to  the  situation  in  three-space.  What’s  the 
plane  z  =  0?  Recall  that  if  a  variable  doesn’t  appear,  then  it  may  assume  any  value. 
So  this  means  that  z  is  fixed  at  0  while  x  can  take  any  value,  as  can  y.  Thus,  the  plane 
z  =  0  is  the  xy-plane.  Similarly,  the  yz -plane  is  the  plane  x  =  0  and  the  xz-plane  is  the 
plane  y  =  0.  These  three  planes  are  called  the  coordinate  planes.  What’s  the  plane 
Z  =  3?  x  =  2?  y  =  yo?  There’s  plenty  to  think  about  here,  but  let’s  start  by  asking 
what  the  distance  is  between  two  points  in  three-space. 

Example  1.2.  Given  two  points  (xo,yo,Zo)  and  (xi,yi,Zi)  in  three-space,  what  is  the 
distance  between  the  two  points? 


z 


Fig.  1.2 
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We  follow  Polya’s  method  to  find  the  solution. 

1.  “Understanding  the  problem.”  Before  we  begin,  we  make  sure  we  really  under¬ 
stand  the  meaning  of  each  word  and  symbol  above.  We  spent  the  last  few  para¬ 
graphs  making  sure  we  all  understand  the  symbols,  and  all  the  words  are  familiar 
ones  that  appear  in  a  standard  English  dictionary.  But,  wait — has  “distance  be¬ 
tween  two  points”  really  been  defined?  We  need  to  be  sure  that  everyone  means 
the  same  thing  by  this.  The  distance  between  these  two  arbitrary  points  would 
mean  the  length  of  the  straight  line  segment  joining  the  two  points.  That’s  what 
we  need  to  find.  What  were  we  given?  Two  points  and  their  coordinates. 

2.  “Devising  a  plan.”  How  do  we  solve  something  like  this?  We  haven’t  covered 
anything  yet,  so  what  can  the  authors  be  thinking?  If  you  have  no  idea  how  to  get 
started,  try  thinking  about  finding  the  distance  between  two  specific  points.  Of 
course,  (and  this  is  very  important)  this  won’t  give  us  a  general  formula  because 
it  is  much  too  specific,  but  maybe  we’ll  get  some  ideas. 

So  what’s  the  distance  between  the  two  points  (1,0,0)  and  (—1,0,0)?  That  ques¬ 
tion  is  easier  to  answer — it’s  two.  What’s  the  distance  between  (1,1,0)  and 
(— 1,— 2,0)?  This  seems  to  be  just  the  distance  between  two  points  in  the  fa¬ 
miliar  xy-plane.  We  saw  a  formula  for  that  at  some  point.  It  was  obtained  using 
the  Pythagorean  Theorem.  What  was  it?  If  you  can’t  recall  the  formula,  look  it 
up  or  (better,  yet)  try  to  derive  it  again. 

Our  reasoning  now  brings  us  to  a  simpler,  similar  question.  As  you  recall,  this  is 
precisely  where  Polya  suggested  we  look  for  a  plan.  So  far,  it  seems  we  can  find 
the  distance  between  two  points  as  long  as  they  lie  in  a  plane  parallel  to  one  of 
the  coordinate  planes.  But  in  this  problem,  if  we  look  at  the  two  points,  they  need 
not  lie  in  such  a  plane.  We  can  try  to  insert  a  third  point  that  helps  us  to  reduce 
the  problem  to  one  we  can  already  solve.  Which  point?  A  picture  will  help  here, 
so  we  draw  one  in  Figure  1.3. 

We  see  that  (xo,yo,2o)  and  (Alonzo)  lie  in  the  plane  z  =  z0,  while  (xi,yi,zo) 
and  (xi  ,yi,zi)  lie  on  the  same  vertical  line,  in  the  intersection  of  the  two  planes. 
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x  =  x\  and  y=y i .  We  “devise  our  plan”  using  these  three  points.  Can  we  get  the 
distance  we  are  looking  for  from  these  three  points?  Look  at  Figure  1.3  and  see 
if  you  can  guess  the  rest  before  going  on  to  Step  3.  You  probably  noticed  that  the 
vertical  line  makes  a  right  angle  with  every  line  in  the  plane  z  =  zo-  This  should 
suggest  something  to  you — something  like  the  Pythagorean  Theorem. 

3.  “Carrying  out  the  plan.”  This  is  the  only  thing  the  reader  will  see.  Everything  that 
preceded  this  was  to  assist  us  in  obtaining  this  solution.  That  means  the  reader 
doesn’t  know  what  the  points  are;  we  have  to  tell  him  or  her  that.  We  should 
make  sure  we  say  why  a  sentence  follows  from  the  previous  one  and  we  should 
use  equal  signs  between  equal  objects.  When  we  think  we  are  done,  we  should 
tell  the  reader  that  too. 

Solution.  Let  P  =  (xo , y() ■  zo )  and  Q  =  (*i,yi,zi)  be  two  points  in  space.  We  claim 
that  the  distance  between  these  two  points,  denoted  by  d(P,  Q),  is 


Proof.  We  introduce  a  third  point  with  coordinates  R  =  (xi,yi,zo)-  Since  (xo,yo,zo) 
and  (xi  ,yi  ,zo)  both  lie  in  the  plane  z  =  zo,  we  can  use  the  distance  formula  for  two 
points  in  a  plane  to  find  the  distance  between  them.  Thus,  the  distance  is  given  by 


Now  look  at  the  distance  between  the  two  points  (xi,yi,zo)  and  (xi,yi,zi).  Since 
these  points  lie  on  the  same  vertical  line,  the  distance  is  given  by 


d{R,Q)  =  |zo-zi|  ■ 


Now,  the  distance  we  are  looking  for  is  the  length  of  the  line  segment  PQ,  which  is 
the  hypotenuse  of  the  right  triangle  PQR  (see  Figure  1.4). 
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This  is  a  right  triangle,  so  we  can  obtain  the  length  using  the  Pythagorean  Theo¬ 
rem.  So,  we  get 


d(P,  Q)  =  yj d(P,R)2  +  d(R,Q)2  . 

Substituting  in  what  we  found  above,  we  obtain 

d(P,Q)  =  yVo  -  XI )2  +  (yo  -  y  1  )2  +  (zo  -  zi  )2  • 

This  completes  the  proof. 

4.  “Looking  back.”  What  we  have  presented  is  our  version  of  the  proof.  You  may 
find  that  you  need  to  include  more  details.  By  all  means,  go  ahead.  If  you  had 
to  stop  and  say,  “where  did  that  come  from?”  make  sure  you  answer  yourself. 
Write  it  in  the  text  (you  aren’t  going  to  sell  this  book  back  anyway,  right?),  or 
keep  a  notebook  of  “proofs  with  commentary.”  Note  that  though  we  used  pictures 
to  illustrate  the  ideas  in  our  argument,  a  picture  will  not,  in  general,  substitute  for 
a  proof.  However,  it  can  really  clarify  an  idea.  Don’t  rely  on  a  picture,  but  don’t 
be  afraid  to  use  one  either.  O 


Solutions  to  Exercises 

Solution  (1.1).  We  are  given  that  this  code  was  created  through  a  shift  of  the  alpha¬ 
bet.  Thus  once  we  determine  one  letter,  the  other  letters  are  easily  found.  Since  we 
have  a  one-letter  word,  we’ll  start  with  it.  Thus  “W”  must  represent  the  letter  “I”  or 
the  letter  “A.”  Checking  both  shifts  of  the  alphabet 


(w  - 

IjX  - 

-+J,Y- 

-> K,Z - 

-» L,A  - 

->  M,B  - 

N,C - 

-»  0,  etc. 

w- 

>  A,X  - 

+ b,y - 

■*C,Z~ 

-+ D,A  - 

->  E,B  - 

+  F,C- 

»  G ,  etc.) 

we  find  that  if  “W”  represents  the  letter  “I,”  then  “E”  must  represent  the  letter  “Q.” 
The  fact  that  “EO”  appears  as  a  word  and  “O”  would  represent  the  letter  “A”  in 
our  coded  text  implies  that  “EO”  would  be  the  word  “QA,”  which  is  an  interesting 
combination  of  letters,  but  hardly  a  word.  Thus,  “W”  cannot  represent  the  letter  “I” 
and  therefore  “W”  represents  “A.” 

Using  the  shift  described  above  and  replacing  the  corresponding  letters,  we  find 
that  the  code  says  the  following. 

“THIS  ENCODING  ALGORITHM  IS  CALLED  A  CAESAR  CIPHER.  IT  IS 

VERY  EASY  TO  BREAK,  RIGHT?” 

In  fact,  the  Caesar  cipher  is  quite  easy  to  break.  If  this  interests  you,  a  very 
readable  history  of  coding  theory  is  presented  by  S.  Singh  in  The  Code  Book ,  [99]. 
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Gyorgy  Polya  (1887-1985),  referred  to  as  George  Polya  in  his  later  years,  was  bom 
and  raised  in  Hungary.  He  studied  in  Vienna  and  in  Budapest,  where  he  received  his 
doctorate  in  1912.  One  of  his  influential  teachers  was  Leopold  Fejer.  In  his  book  [85, 
p.  39],  Polya  refers  to  Fejer  as  “an  inspiring  teacher  who  had  a  great  deal  of  influence 
on  Hungarian  mathematicians  of  the  time.”  The  two  primary  places  where  Polya 
taught  were  the  Eidgenossische  Technische  Hochschule  (ETH)  in  Zurich,  Switzer¬ 
land,  and  Stanford  University  in  Palo  Alto,  California. 

Though  Polya’s  mother  tongue  was  Hungarian,  he  worked  in  the  Swiss-German- 
speaking  part  of  Switzerland  and  he  spoke  French  with  his  wife  from  Neuchatel,  a 
city  in  the  French-speaking  part  of  Switzerland.  In  school  he  also  learned  Latin  and 
Greek.  (See  [85,  p.  11].)  Polya  later  emigrated  to  the  United  States  where  he  taught 
and  lectured  in  English.  He  published  mathematical  papers  in  Hungarian,  German, 
French,  English,  Italian,  and  Danish. 

Polya  contributed  to  original  research  in  probability,  geometry,  number  theory, 
real  and  complex  analysis,  graph  theory,  combinatorics,  and  mathematical  physics. 
His  name  is  connected  to  many  mathematical  ideas  and  constructions.  To  name  just  a 
few  of  his  achievements,  we  mention  that  in  probability  there  is  a  Polya  distribution 
and  he  is  credited  with  introducing  the  idea  and  the  term  of  “random  walk.”  But 
Polya  was  not  only  recognized  as  an  excellent  scholar  of  mathematics,  he  was  also 
an  excellent  teacher  of  mathematics.  His  heuristic  approach  to  problem  solving  is 
outlined  in  How  to  Solve  It.  This  book  had  a  profound  influence  on  the  teaching 
of  mathematics.  It  has  sold  over  one  million  copies  and  is  translated  into  over  20 
languages.  Records  kept  by  the  ETH  in  Zurich  show  that  Polya  was  the  advisor  of 
14  thesis  students  there  and,  according  to  [78],  he  was  the  advisor  of  9  more  students 
at  Stanford. 

The  Mathematical  Association  of  America  (MAA)  gives  an  annual  Polya  award. 
According  to  the  MAA  website,  “This  award,  established  in  1976,  is  named  after 
the  renowned  teacher  and  writer,  and  is  given  for  articles  of  expository  excellence 
published  in  the  College  Mathematics  Journal .” 

To  leant  more  about  George  Polya  and  his  approach  to  problem  solving,  we  rec¬ 
ommend  reading  his  book  How  to  Solve  It,  [84],  the  picture  book  [85]  (which  con¬ 
tains  a  short  biography),  or  consulting  the  more  in-depth  account  of  Polya’s  life  [4], 
written  by  his  former  student  at  Stanford.  The  article  [110]  is  based  on  interviews 
with  Polya  and  appeared  in  an  issue  of  Mathematics  Magazine  entirely  devoted  to 
Polya  and  his  work. 


Problems 

Problem  1.1.  Here  is  a  problem  intended  to  help  you  work  through  “the  list.”  After 
this,  you  are  on  your  own. 
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Find  a  word  (written  in  standard  capital  letters)  that  is  unchanged  when  reflected 
in  a  horizontal  line  and  in  a  vertical  line.  The  word  must  appear  in  a  dictionary  (in  a 
language  of  your  choice)  in  order  to  be  a  valid  solution. 

1.  “Understanding  the  problem."  We  need  to  find  a  word.  We  are  given  information 
about  the  letters  that  make  up  this  word.  There  are  two  conditions:  Two  different 
reflections  should  not  alter  the  word. 

Try  these  two  reflections  on  a  word,  say  on  SOLUTION,  to  make  sure  you  un¬ 
derstand  the  problem. 

2.  “Devising  a  plan.”  We  have  to  find  the  connection  between  what  we  are  given 
and  what  we  have  to  find. 

Which  letters  of  the  alphabet  satisfy  each  of  the  two  conditions? 

Both  conditions? 

Find  a  word  that  is  not  changed  if  it  is  reflected  in  a  horizontal  line. 

Find  a  word  that  is  not  changed  if  it  is  reflected  in  a  vertical  line. 

Formulate  the  exact  conditions  for  this  exercise;  that  is,  state  the  letters  that  can 
be  used  and  how  they  must  be  arranged. 

3.  “Carrying  out  the  plan.”  Find  a  word  that  satisfies  the  conditions  given  above. 

4.  “Looking  back.”  Are  there  other  solutions? 

Problem  1.2.  Find  a  word  (written  in  standard  capital  letters)  that  reads  the  same 
forward  and  backward  and  is  still  the  same  forward  and  backward  when  rotated 
around  its  center  180°.  Your  solution  needs  to  appear  in  a  standard  dictionary  of 
some  language. 

Problem  1.3.  Solve  the  following  anagrams.  The  first  three  are  places  (in  the  ge¬ 
ographical  sense),  and  the  fourth  is  a  place  in  which  you  might  live.  All  can  be 

rearranged  to  form  a  single  word. 

(a)  NOVA  CURVE; 

(b)  NINE  SLAP  NAVY; 

(c)  I  HELD  A  HIP  PAL; 

(d)  DIRTY  ROOM. 

Note:  You  may  have  to  find  out  exactly  what  an  anagram  is.  This  is  part  of  Polya’s 
first  point  on  the  list. 

Problem  1.4.  Suppose  n  teams  play  in  a  single  game  elimination  tournament.  How 
many  games  are  played? 

An  example  of  such  tournaments  are  the  various  categories  of  the  U.  S.  Open 
tennis  tournament;  for  example,  women’s  singles. 

Note:  Pay  special  attention  to  the  first  entry  of  Polya’s  list:  “Is  it  possible  to 
satisfy  the  condition?” 

Problem  1.5.  Suppose  you  are  all  alone  in  a  strange  house.  There  are  seven  identical 
closed  doors.  The  bathroom  is  behind  exactly  one  of  them.  Is  it  more  likely,  less 
likely,  or  equally  likely  that  you  find  the  bathroom  on  the  first  try  than  on  the  third 
try?  Why? 
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Problem  1.6.  The  following  message  is  encoded  using  a  shifted  alphabet  just  as  in 
Exercise  1.1.  (Of  course,  the  shift  number  n  is  not  the  same  as  in  the  exercise!)  What 
does  the  message  say? 

RDSXCVIWTDGNXHUJCLTLXAAATPGCBDGTPQDJIXIAPITG 

Problem  1.7.  Give  a  detailed  description  of  all  points  in  three-space  that  are  equidis¬ 
tant  from  the  jc-axis  and  the  yz-plane.  Once  you  decide  on  the  answer,  write  the 
solution  up  carefully.  Pay  particular  attention  to  your  notation. 

Problem  1.8.  The  following  is  a  classic  problem  in  mathematics.  Though  there  are 
many  variations  of  this  problem,  the  standard  one  is  the  following. 

You  are  given  12  coins  that  appear  to  be  identical.  However,  one  of  the  coins  is 
counterfeit,  and  the  weight  of  this  coin  is  slightly  different  than  that  of  the  other  1 1 . 
Using  only  a  two-pan  balance,  what  is  the  smallest  number  of  weighings  you  would 
need  to  find  the  counterfeit  coin?  (Think  about  a  simpler,  similar  problem.) 

(See  I.  Peterson’s  website  [82]  for  a  discussion  of  this  problem.) 

Problem  1.9.  Let  n  be  an  odd  integer.  Prove  that  n3  —  n  is  divisible  by  24. 

The  following  two  problems  are  only  appropriate  if  you  took  at  least  two  semesters 
of  calculus.  Though  you  may  have  worked  these  before,  the  idea  is  to  work  them 
again  paying  close  attention  to  the  final  presentation.  Make  sure  you  define  all  vari¬ 
ables.  Use  complete  sentences,  with  proper  punctuation. 

Problem  1.10.  Find  the  volume  of  a  spherical  cap  if  the  height  is  2  m  and  the  radius 
of  the  rim  of  the  cap  is  5  m. 

Problem  1.11.  We  have  two  circular  right  cylinders  of  radius  1  each.  The  axes  of 
the  two  cylinders  intersect  at  a  right  angle.  Find  the  volume  of  the  solid  that  both 
cylinders  have  in  common. 

Problem  1.12.  Shlomo  Sureshot  started  the  basketball  season  with  a  free  throw 
shooting  percentage  of  below  75%.  By  the  end  of  the  season  he  brought  it  up  to 
above  75%.  Must  there  have  been  a  time  in  the  season  (after  a  free  throw  attempt) 
when  his  free  throw  percentage  was  exactly  75%? 

After  having  solved  this  problem  and  looking  back  at  your  solution,  are  there 
questions  that  you  would  like  to  answer?  Can  you  answer  them? 

Problem  1.13.  An  old  Spanish  treasure  is  hidden  away  in  a  magical  box  with  integer 
dimensions;  that  is,  height,  width,  and  length  are  all  of  integer  values  if  measured  in 
“braza,”  a  unit  of  length.  The  volume  of  the  box  is  40  braza3.  Though  knowing  the 
sum  of  the  dimensions  will  not  completely  determine  them,  knowing  (in  addition) 
that  the  square  front  face  is  painted  red  will.  What  are  the  dimensions  of  the  box? 
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Tips  on  Doing  Homework 

Your  instructor  will  probably  ask  you  to  work  many  of  the  exercises  and  problems 

in  this  text.  If  there  is  one  thing  mathematicians  agree  on,  it  is  that  you  learn  math¬ 
ematics  by  doing  it.  Here  are  some  tips  on  how  to  get  started. 

•  Make  sure  you  know  what  the  rules  are.  Some  instructors  do  not  want  you  to  get 
help  from  someone  else.  Other  instructors  encourage  working  together  in  groups. 
Ask,  if  you  are  not  clear  about  the  policy. 

•  If  you  are  permitted  to  work  together,  form  a  study  group.  A  small  group  of  two 
to  four  people  usually  works  best.  Get  together  on  a  regular  basis  and  discuss  the 
assigned  problems. 

•  Read  the  questions  carefully.  If  there  is  a  term  that  you  do  not  know,  look  it  up. 

•  Before  you  get  started,  read  over  the  text  and  the  notes  from  class,  paying  partic¬ 
ular  attention  to  definitions,  theorems,  and  previous  exercises.  It  isn’t  unusual  to 
spend  several  hours  on  a  single  problem  at  this  point.  Doing  mathematics  means 
pondering  a  problem  for  hours,  days,  weeks,  even  years  (though  we  have  tried 
not  to  pose  problems  that  will  take  you  years  to  solve).  Working  two  hours  on 
one  problem,  thinking  about  it  as  you  go  through  your  day  and  then  spending 
another  two  hours  on  it  the  next  day  is  fairly  common  practice  for  students  at  this 
level. 

•  Once  you  have  read  over  the  text,  looked  over  the  relevant  definitions,  worked 
through  the  examples,  and  tried  to  solve  the  problem,  you  will  be  well  on  your 
way  toward  understanding  the  problem.  If  you  can’t  get  started,  at  least  you  will 
know  which  questions  to  ask.  Seek  help  from  your  instructor  or  other  students  (if 
your  instructor  allows  this). 

•  Once  you  have  a  solution  to  a  problem,  look  at  it  critically.  Check  that  it  is  correct. 
Put  it  down.  Come  back  to  it  later.  Do  you  still  understand  everything?  Is  it  still 
correct?  (As  you  can  imagine,  this  is  very  important.)  Can  you  simplify  it?  If  you 
work  with  someone  else,  have  them  read  it  over.  Never  hand  in  your  first  draft  of 
a  solution  to  a  problem. 

•  Writing  a  solution  means  convincing  a  reader  that  the  result  is  correct.  There 
can  be  no  gaps  or  errors.  Explain  each  step — don’t  assume  that  the  reader  knows 
what  you  are  thinking.  Keep  a  reader  in  mind  as  you  write,  and  remember  that  the 
instructor  or  anyone  else  who  already  knows  the  solution  is  not  really  your  target 
audience.  Though  that  may  be  the  person  for  whom  the  solution  is  intended,  it  is 
your  job  to  convince  the  reader  that  each  step  in  your  solution  is  correct.  Perhaps 
a  better  audience  to  keep  in  mind  is  someone  who  knows  the  material  from  the 
class,  but  not  the  solution  to  the  problem. 

•  Write  up  your  final  solution  very  carefully  and  neatly.  The  reader  shouldn’t  find 
him-  or  herself  proving  things  for  you — you  should  do  that  for  him  or  her.  Staple 
pages  together  so  that  the  reader  may  have  the  pleasure  of  reading  your  proof  in 
the  correct  order  and  its  entirety. 


Chapter  2 

Logically  Speaking 


“I  know  what  you’re  thinking  about,”  said  Tweedledum;  “but  it  isn’t  so,  nohow.”  “Contrari¬ 
wise,”  continued  Tweedledee,  “if  it  was  so,  it  might  be;  and  if  it  were  so,  it  would  be;  but  as 

it  isn’t,  it  ain’t.  That’s  logic.” — Lewis  Carroll,  [17,  p.  139] 

Suppose  your  friend  tells  you  that  Mr.  Hamburger  is  German  or  Swiss.  You  hap¬ 
pen  to  know  that  Mr.  Hamburger  is  not  Swiss.  Using  your  powers  of  reasoning,  you 
decide  that  Mr.  Hamburger  is  German.  Note  that  this  argument  can  be  generalized, 
because  it  doesn’t  really  depend  on  Mr.  Hamburger  being  Swiss  or  German.  If  your 
friend  said  that  “A  or  B  is  true”  and  you  happened  to  know  that  “B  is  not  true,” 
you  would  conclude  that  “A  is  true.”  This  is  an  example  of  a  valid  argument.  Now 
suppose  your  friend  tells  you  that  Mr.  French  eats  only  pickles  on  Wednesday,  and 
only  chocolate  on  Monday.  You  know  that  Mr.  French  is  eating  chocolate  that  day. 
Now  what  can  you  say?  While  you  may  conclude  that  Mr.  French  has  odd  eating 
habits,  you  would  not  have  used  a  logically  valid  argument  to  do  so.  In  this  example, 
there  is  really  only  one  thing  you  can  conclude.  We’ll  return  to  this  at  the  end  of  this 
chapter. 

In  order  to  understand  an  argument,  we  must  be  able  to  read  and  comprehend 
the  sentences  that  compose  it.  We  need  to  be  able  to  tell  whether  the  sentences  in 
our  argument  are  true  or  false,  and  whether  they  follow  logically  from  the  previous 
ones.  So  now  for  a  definition.  A  statement  is  a  sentence  that  is  true  or  false,  but  not 
both.  “Two  is  not  a  prime  number”  is  an  example  of  a  (false)  statement.  “Do  you 
love  me?”  is  not  a  statement.  Below  are  some  examples  and  some  nonexamples  of 
statements.  These  will  be  your  first  examples  of  nonexamples. 

Exercise  2.1.  Which  of  the  sentences  below  are  statements  and  which  are  not? 

(a)  It  is  raining  outside. 

(b)  The  professor  of  this  class  is  a  woman. 

(c)  Two  plus  two  is  five. 

(d)  Y  +  6  =  0. 

(e)  Seven  is  a  prime  number. 

(f)  All  odd  numbers  are  prime. 


U.  Daepp  and  P.  Gorkin,  Reading,  Writing,  and  Proving:  A  Closer  Look  at  Mathematics , 
Undergraduate  Texts  in  Mathematics,  DOI  10. 1007/978-1 -44 19-9479-0  2, 
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(g)  This  sentence  is  false. 


o 


Because  English  usage  and  mathematical  usage  may  differ  slightly,  we  must  be 
certain  that  we  understand  our  statements  before  we  construct  arguments.  We  now 
carefully  study  the  truth  or  falsity  of  statements.  Our  treatment  is  brief.  (See  [69]  for 
a  more  detailed  study  of  mathematical  logic.) 

The  rules  of  logic  that  we  present  in  this  chapter  should  work  for  all  statements, 
and  not  just  particular  ones.  For  this  reason,  we  introduce  letters  such  as  P,Q,R,  or 
S  to  represent  statements.  Thus  P  will  have  two  possible  truth  values:  true,  denoted 
T,  or  false,  denoted  F.  We  can  negate  P  or  combine  it  with  Q  by  saying  things  like: 

Not  P. 

P  and  Q  . 

P  or  Q. 

If  P,  then  Q. 

P  if  and  only  if  Q. 

Such  symbolic  sentences  will  be  called  statement  forms.  A  precise  definition  of 
statement  form  will  be  given  once  we  have  precise  definitions  of  the  connectives 
“not,”  “and,”  “or,”  “if  . . .,  then  . . and  “if  and  only  if.” 

In  the  English  language  we  might  say 

It  is  raining. 

It  is  not  raining. 

If  it  is  raining,  the  sky  is  gray. 

It  is  raining  or  it  is  snowing. 

It  is  cold  and  it  is  snowing. 

It  is  snowing  if  and  only  if  it  is  cold. 

Let’s  start  with  the  simplest  case.  Suppose  your  teacher  says,  “This  book  has  a 
blue  cover.”  Taking  a  quick  glance  at  the  cover,  you  can  decide  on  the  truth  value  of 
that  statement;  namely,  that  it  is  false.  In  order  to  have  a  true  statement,  you  could 
say,  “This  book  does  not  have  a  blue  cover.”  If  we  have  a  statement  form  P,  the 
negation  of  P  is  the  statement  form  “not  PP  Under  what  circumstances  should  the 
negation  of  P  be  true  or  false?  We  will  always  use  the  notation  -P  for  “not  PP  If 
P  is  true,  then  —P  should  be  false.  If  P  is  false,  then  -P  should  be  true.  We  can 
summarize  all  the  possibilities  in  a  truth  table  as  follows: 


P 

Y 

F 


~F~ 

T 


What  about  combining  two  statement  forms,  P  and  Q,  into  one  statement  form 
as  “P  or  2”?  In  this  sentence,  it  is  particularly  important  to  distinguish  between 
mathematical  usage  of  the  word  “or”  and  everyday  speech.  For  example,  if  we  say, 
“You  can  have  cake  or  ice  cream,”  it  could  be  that  you  can  have  both.  If  we  say,  “The 
door  is  open  or  closed,”  it  cannot  be  that  the  door  is  both  open  and  closed.  English 
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statements  involving  the  word  “or”  are  often  ambiguous;  in  mathematics,  ambiguity 
is  generally  frowned  upon.  The  statement  form  “P  or  Q”  is  called  a  disjunction  and 
is  denoted  P  V  Q.  In  mathematics,  a  disjunction  is  true  when  P  alone  is  true,  Q  alone 
is  true,  or  both  P  and  Q  are  true.  So  in  mathematics,  you  can  always  have  your  cake 
and  ice  cream. 


Exercise  2.2.  Complete  the  truth  table  for  PVQ. 


p 

Q 

PVQ 

T 

T 

T 

F 

F 

T 

F 

F 

The  statement  form  “P  and  Q”  is  called  a  conjunction  and  is  denoted  P  A  Q.  We 
will  have  you  fill  in  the  truth  table  for  “P  and  Q”  below.  It  should  be  clear  that  this 
will  be  true  when  both  P  and  Q  are  true,  and  false  otherwise. 


Exercise  2.3.  Complete  this  truth  table. 


P 

Q 

PAQ 

T 

T 

T 

F 

F 

T 

F 

F 

Now  consider  the  statement  form  “If  P,  then  Q.”  This  statement  form  is  called 
an  implication  and  is  often  stated  as  “P  implies  Q'  and  written  P  — >  (Q.  (Note  that 
though  English  usage  of  the  word  “implies”  may  suggest  a  relationship  between 
P  and  Q,  our  analysis  of  truth  values  has  assumed  no  connection  at  all  between  P 
and  Q.)  There  are  equivalent  ways  of  stating  an  implication,  and  some  will  require 
careful  thinking  on  the  reader’s  part.  Remember  as  you  read  on  that  “If  P,  then  Q" 
may  also  be  stated  as 

Q  if  P. 

P  is  sufficient  for  Q  (meaning  P  is  enough  to  make  Q  happen). 

Q  is  necessary  for  P  (if  P  happened,  then  Q  must  have  happened). 

P  only  if  Q  (same  as  above;  if  P  happened,  then  Q  must  have  happened). 

Q  whenever  P. 

The  statement  form  P  in  each  of  these  formulations  is  called  the  antecedent,  and 
Q  is  called  the  conclusion.  Under  what  conditions  is  an  implication  true?  false?  Let’s 
begin  with  an  example  you  are  all  familiar  with.  Suppose  we  say  to  our  son, 

“If  you  clean  your  room,  then  you  can  go  to  Henry’s  house.” 
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Under  what  conditions  would  he  feel  that  we  had  lied?  In  the  example,  the  an¬ 
tecedent,  P,  is  “you  clean  your  room.”  The  conclusion,  Q,  is  “you  can  go  to  Henry’s 
house.”  Well,  if  our  son  cleans  his  room  and  we  let  him  go  to  Henry’s,  everybody 
is  happy.  That  implication  should  be  true.  So,  if  P  is  true  and  Q  is  tme,  the  whole 
statement  should  be  tme.  Also,  it  should  be  as  clear  to  you  as  it  will  be  to  our  son, 
that  if  he  cleans  his  room  and  we  do  not  let  him  go  to  Henry’s,  we  lied.  So,  if  P  is 
true,  and  Q  is  false,  the  implication  should  be  false.  Now  what  if  he  doesn’t  clean 
his  room?  We  never  discussed  this  possibility.  So,  no  matter  what  we  decide  here, 
we  have  not  lied.  In  this  situation,  the  statement  is  not  false;  hence  we  consider  it  to 
be  true.  So,  if  P  is  false,  no  matter  what  the  truth  value  is  of  the  conclusion,  we  will 
consider  the  implication  to  be  true. 

Summarizing  this  discussion,  the  only  way  that  the  implication  “If  P,  then  Q” 
can  be  false  is  if  P  is  true  and  Q  is  false.  In  the  exercise  below  you  will  sum  up  this 
discussion  in  the  form  of  a  truth  table. 


Exercise  2.4.  Complete  this  truth  table. 


P 

Y 

T 

F 

F 


Q 

T 

F 

T 

F 


P 


Q 


O 


It  is  often  helpful  to  rephrase  a  statement,  making  sure  that  you  maintain  the 
same  tme  and  false  values.  The  statement  form  “P  if  and  only  if  Q"  is  called  an 
equivalence,  and  we  will  write  this  as  P  <->  Q.  This  is  the  same  statement  form  as 
“(P  only  if  Q)  and  (P  if  Q).”  In  view  of  the  discussion  above,  we  see  that  this  is  also 
(p  -  Q)  A  {Q  — -  P) .  Thus  the  truth  table  for  the  equivalence  is 


P 

Y 

T 

F 

F 


T 

F 

T 

F 


T 

F 

T 

T 


T 

T 

F 

T 


P 


P  Q 


T 

F 

F 

T 


Look  down  the  final  column  and  you’ll  see  that  the  equivalence  is  tme  precisely 
when  P  and  Q  are  both  tme  or  both  false. 

The  statement  form  “If  P,  then  Q”  is  also  written  as  P  =>  Q,  and  “P  if  and  only  if 
Q”  might  be  written  as  P  •£=>  Q  or  “P  iff  Q.” 

Having  studied  the  connectives,  we  are  ready  for  our  definition  of  a  statement 
form.  A  statement  form  is  a  letter  representing  an  unspecified  statement  or  an  ex¬ 
pression  built  from  such  letters  using  connectives.  Statement  forms  can  be  quite 
complicated.  Assigning  tmth  values  to  them  can  be  done  in  a  step-by-step  fashion. 
The  exercise  below  illustrates  this. 


Exercise  2.5.  Find  the  tmth  table  for  the  statement  form 


2  Logically  Speaking 


17 


(P-H2V.R))A(tfVg). 

To  solve  this  you  must  break  the  complicated  form  ( P  — -  (~Q  V  *))  A  (R  V  Q)  into 
simpler  parts.  Once  you  have  done  this,  you  should  find  the  truth  value  of  each  of 
the  parts  using  the  truth  values  of  P.  Q.  and  R.  Q 

Now  consider  the  two  statement  forms  ->(P  V  Q)  and  -P  A  —Q.  In  the  next  exer¬ 
cise,  you  will  find  the  truth  table  for  each  of  these  expressions  and  compare  them. 

Exercise  2.6.  Write  out  the  truth  tables  for  -i(PVQ),  ~^P  A  ->Q,  and  (—(A  V  Q))  <-> 
(~iP  A  —Q) .  What  can  you  conclude?  O 


A  statement  form  for  which  the  final  column  in  the  truth  table  consists  of  all  7”s 
is  called  a  tautology.  A  statement  form  for  which  the  final  column  is  all  F’ s  is  called 
a  contradiction.  Two  statement  forms,  P  and  Q,  are  said  to  be  (logically)  equivalent 
if  P  <->  Q  is  a  tautology,  and  two  statements  are  equivalent  if  they  can  be  obtained 
from  two  equivalent  statement  forms  by  consistently  replacing  the  letters  by  English 
statements. 

In  view  of  Exercise  2.6,  we  see  that  -*(PV  Q)  and  -P  A  ~Q  are  equivalent  state¬ 
ment  forms.  Thus  the  statement  “It  is  not  the  case  that  Rachel  or  Leah  won  the  race” 
is  equivalent  to  “Rachel  did  not  win  the  race  and  Leah  did  not  win  the  race.”  (Why?) 

In  Exercise  2.6  you  noticed  that  the  two  statement  forms  -'(P  V<2)  and  -P  A  —Q 
have  the  same  truth  table,  and  (->(P  V  Q))  <->  (-P  A  ~^Q)  is  a  tautology.  This  isn’t 
something  that  happens  in  this  one  particular  example.  Whenever  we  notice  that  a 
statement  is  always  true,  we  can  state  that  fact  as  a  theorem.  Of  course,  we  will 
need  to  give  a  conclusive  argument  for  the  statement’s  truth,  and  this  is  called  a 
proof  of  the  theorem.  We’ll  present  a  theorem  and  a  proof  below,  but  remember: 
The  statement  forms,  P  and  Q,  might  very  well  be  complicated  constructions  with 
many  connectives.  For  instance,  P  could  be  R  — >  (S  V  (—7  AR)).  In  fact,  the  theorem 
we  present  below  is  most  interesting  when  P  is  complicated! 

Theorem  2.7.  Two  statement  forms  P  and  Q  are  equivalent  if  and  only  if  they  have 
the  same  truth  table. 


Proof  Consider  the  truth  table  for  the  equivalence  P  <->  Q: 


P 

Y 

T 

F 

F 


Q 

T 

F 

T 

F 


P  Q 

T 

F 

F 

T 


We  know  that  P  <->  Q  is  a  tautology  if  and  only  if  P  <->  Q  has  the  truth  value  T  (in 
the  table  above,  this  is  row  1  and  row  4).  Finally,  P  <->  Q  has  the  truth  value  T  if  and 
only  if  P  and  Q  have  the  same  truth  value.  Since  P  and  Q  are  equivalent  if  and  only 
if  P  <-►  Q  is  a  tautology,  this  establishes  the  theorem.  □ 
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While  it  is  very  important  to  be  able  to  restate  something  in  an  equivalent  form, 
it  is  equally  important  that  you  be  able  to  negate  a  statement.  Some  useful  negations 
appear  in  the  exercises  and  problems.  The  negation  of  an  implication  is  particularly 
important  in  mathematics.  If  you  think  about  integers  and  the  sentence  “If  x  is  prime, 
then  x  is  odd  or  x  =  2,”  you  can  see  that  even  a  relatively  simple  implication  might 
be  difficult  to  negate.  Let’s  begin  with  something  simpler. 

Exercise  2.8.  Construct  the  truth  table  for  P  — >  Q,  and  the  truth  table  for  ->P  V  Q. 
What  do  you  notice?  Now  construct  a  truth  table  for  (P  — ►  Q)  <->  (-P  V  Q).  What 
conclusion  can  you  make?  Finally,  find  an  equivalent  way  to  write  ->(P  — >  <2).  O 

If  all  went  well,  you  noticed  that  P  — >  Q  is  equivalent  to  ~P  V  Q,  and  therefore 
the  negation  of  “If  P,  then  Q”  is  “P  and  not  Q.”  Let’s  return  to 

“If  xis  prime,  then  x  is  odd  or x  2.” 
p  Q 


Negating  this  leads  to 


While  this  is  the  negation,  it  isn’t  really  as  helpful  as  it  might  be.  So  we  now  negate 
the  disjunction  “x  is  odd  or  x  =  2”  and  combine  it  with  our  previous  work  to  obtain 

“x  is  prime  and  x  is  not  odd  and  x  ^  2.” 


Refining  this  further,  we  would  probably  say  something  like  “x  is  prime,  even,  and 
not  equal  to  two.” 

The  negation  of  an  implication  is  something  you  should  learn  well  now  because 
it  arises  frequently.  In  the  theorem  below,  we  summarize  the  five  most  important 
equivalences  that  we  have  covered  so  far.  The  first  two  are  often  referred  to  as  De- 
Morgan’s  laws. 

Theorem  2.9.  Let  P  and  Q  denote  statement  forms.  The  following  are  tautologies: 

(DeMorgan’s  laws)  ->(PV(2)  <->  (-iP  A —Q)\ 
-OPA0)^(^PV-e); 

(Implication  and  (P  — >  Q)  «->  (->P  V  Q); 

its  negation)  —>(P — >  Q)  <->  (PA~iQ); 

(Double  negation)  —'(—P)  <->  P. 

Proof.  In  Exercise  2.6  we  showed  that  the  first  tautology  holds.  You  will  establish 
the  second  in  Problem  2.3.  The  third  and  fourth  tautologies  were  the  content  of  Ex¬ 
ercise  2.8.  Finally,  you  will  establish  the  last  tautology  when  you  work  Problem  2.2. 

□ 
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Here  are  some  examples  for  you  to  try. 

Exercise  2.10.  Negate  the  following.  It’s  interesting  to  note  that  you  can  negate  a 
statement  even  if  you  don’t  understand  what  it  says.  It  is  easier  to  get  it  right,  though, 
if  you  understand  the  statement. 

(a)  If  I  go  to  the  party,  then  he  is  there. 

(b)  If  x  is  even,  then  x  is  divisible  by  2. 

(c)  If  a  function  is  differentiable,  then  it  is  continuous. 

(d)  If  x  is  a  natural  number,  then  x  is  even  or  x  is  odd.  Q 


Exercise  2.11.  Which  of  the  following  are  equivalent  to  each  other?  All  the  answers 
have  appeared  in  this  chapter. 

P^Q,^(P\/Q),^(PAQ),PA^Q^(P^Q), 

PV^Q,^PV^Q,^PA^Q,^PVQ.  O 

So  let’s  apply  what  we  have  learned  in  this  chapter  to  Mr.  French,  who  eats  only 
pickles  on  Wednesday  and  only  chocolate  on  Monday.  One  statement  is  that  “if  it  is 
Wednesday,  then  Mr.  French  eats  only  pickles.”  We  let  W  represent  the  statement  “it 
is  Wednesday,”  and  P  the  statement  “Mr.  French  eats  only  pickles.”  Thus,  we  know 
that  W  — *  P  is  true.  (If  you  thought  we  should  have  said  W  AP  is  true,  note  that  we 
do  not  know  that  the  statement  W  is  true,  so  we  must  use  the  implication  here.)  The 
second  is  “if  it  is  Monday,  then  Mr.  French  eats  only  chocolate.”  Letting  M  denote 
“it  is  Monday”  and  C  the  statement  that  “Mr.  French  eats  only  chocolate”  we  may 
write  what  we  are  given  as  M  — >  C.  Finally  we  are  told  that  “Mr.  French  is  eating 
chocolate.”  From  this  we  can  conclude  that  —P  is  true.  Let’s  put  this  together. 

1.  W-^P, 

2.  M  — >  C,  and 

3.  -~P. 

Now,  it’s  fairly  clear  that  the  second  statement  is  irrelevant.  So  let  us  look  at  the 
truth  tables  for  the  first  and  third  statements  (for  convenience,  we  combine  the  two 
tables): 


w 

P 

W^P 

^P 

T 

T 

T 

F 

T 

F 

F 

T 

F 

T 

T 

F 

F 

F 

T 

T 

We  know  that  both  W  — >  P  and  —P  are  true,  and  from  our  truth  table  we  see  that 
there  is  only  one  time  that  this  happens:  when  both  W  and  P  are  false.  So  there  you 
have  it.  All  we  can  conclude  is  that  it  is  not  Wednesday. 

People  differ  in  their  approaches  to  problems.  In  the  example  above,  you  might 
have  found  it  easier  not  to  rewrite  the  problem.  That’s  fine.  On  the  other  hand,  when 
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a  problem  starts  to  confuse  you,  looking  at  it  as  we  have  here  will  often  help  you 
figure  out  how  to  attack  a  problem. 


Solutions  to  Exercises 


Solution  (2.1).  All  sentences  are  statements  except  (d)  and  (g). 

Part  (d)  is  not  a  statement  because  its  truth  depends  on  X,  and  X  is  a  variable.  So 
the  sentence  is  sometimes  true  and  sometimes  false. 

Part  (g)  is  tricky.  Suppose  it  were  a  statement.  Then  it  would  have  to  be  true  or 
false,  but  not  both.  Suppose  “This  sentence  is  false”  were  true.  Then  it  would  have  to 
be  false,  but  it  cannot  be  both  true  and  false.  From  this  we  conclude  that  the  sentence 
has  to  be  false.  But  reading  the  sentence  tells  us  that  if  it  is  false,  it  must  again  be 
true  as  well.  We  conclude  that  it  cannot  be  a  statement,  because  we  cannot  assign  a 
unique  truth  value  to  it. 


Solution  (2.2).  The  truth  table  for  P  V  Q  is 


p 

Q 

PVQ 

T 

T 

T 

T 

F 

T 

F 

T 

T 

F 

F 

F 

Solution  (2.3).  The  truth  table  for  P  A  Q  is 


P 

Q 

PAQ 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

F 

Solution  (2.4).  The  truth  table  for  P  — >  Q  is 


P 

Y 

T 

F 

F 


Q 

T 

F 

T 

F 


P 


T 

F 

T 

T 


Q 


This  is  the  same  as  the  truth  table  for  -P  V  Q. 


Solution  (2.5).  We  will  break  the  problem  into  parts  in  such  a  way  that  at  each  step 
we  apply  only  one  additional  connective  (except  for  the  negation,  which  is  handled 
easily  in  general). 
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Solution  (2.6).  We  note  that  the  last  statement  form  is  always  true.  (Note  that  the 
first  and  second  statement  forms  have  the  same  truth  table.) 

Solution  (2.8).  In  the  solution  to  Exercise  2.4,  we  noted  that  P  — >■  Q  and  ~P  /  Q  are 
equivalent.  Thus  —■{P  — ►  Q)  is  equivalent  to  -i(-iP  V  Q),  which  is,  as  we  have  seen 
in  Exercise  2.6,  equivalent  to  P  A~><2-  In  words,  the  negation  of  “If  P,  then  Q”  is 
“P  and  not  Q.” 

Solution  (2.10).  More  than  one  answer  is  possible  but  they  must  be  equivalent,  of 
course. 

(a)  I  go  to  the  party  and  he  is  not  there. 

(b)  One  answer  is:  x  is  even  and  x  is  not  divisible  by  2. 

(c)  A  function  is  differentiable  and  it  is  not  continuous. 

(d)  One  answer  is:  x  is  a  natural  number  and  x  is  not  even  and  x  is  not  odd. 
Equivalently,  we  could  say:  x  is  a  natural  number,  and  x  is  neither  even  nor 
odd. 


Problems 


Problem  2.1.  In  the  following  implications,  identify  the  antecedent  and  the  conclu¬ 
sion.  (Don’t  worry  about  whether  the  implication  is  true  or  false.) 

(a)  If  it  is  raining,  I  will  stay  home. 

(b)  I  wake  up  if  the  baby  cries. 

(c)  I  wake  up  only  if  the  fire  alarm  goes  off. 

(d)  If  x  is  odd,  then  x  is  prime. 

(e)  The  number  x  is  prime  only  if  x  is  odd. 

(f)  You  can  come  to  the  party  only  if  you  have  an  invitation. 

(g)  Whenever  the  bell  rings,  I  leave  the  house. 

Problem#  2.2.  Construct  a  truth  table  for  -i(-iP).  Is  this  what  you  expect?  Why? 

Problem*  2.3.  Write  out  the  truth  tables  for  ->(P  A  (?)  and  -iP  V  —Q.  What  can  you 
conclude? 
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Problem#  2.4.  Find  a  statement  form,  S,  equivalent  to  -i(P  V  Q)  and  show  that  it  is 
logically  equivalent  by  constructing  the  truth  table  for  “S  if  and  only  if  —  ( P  V  QT 
and  showing  that  this  statement  form  is  a  tautology. 

Problem  2.5.  Write  out  the  truth  table  for  the  statement  form  P  — »  —(Q  A  —A).  Is 
this  statement  form  a  tautology,  a  contradiction,  or  neither? 

Problem  2.6.  Write  out  the  truth  table  for  the  statement  form  (P  — >  (— >/?  V  Q)  )  A  R. 
Is  this  statement  form  a  tautology,  a  contradiction,  or  neither? 

Problem  2.7.  Negate  the  sentences  below  and  express  the  answer  in  a  sentence  that 
is  as  simple  as  possible. 

(a)  I  will  do  my  homework  and  I  will  pass  this  class. 

(b)  Seven  is  an  integer  and  seven  is  even. 

(c)  If  T  is  continuous,  then  T  is  bounded. 

(d)  I  can  eat  dinner  or  go  to  the  show. 

(e)  If  x  is  odd,  then  x  is  prime. 

(f)  The  number  x  is  prime  only  if  x  is  odd. 

Problem  2.8.  Negate  the  following. 

(a)  If  I  am  not  home,  then  Sam  will  answer  the  phone  and  he  will  tell  you  how  to 
reach  me. 

(b)  If  the  stars  are  green  or  the  white  horse  is  shining,  then  the  world  is  eleven 
feet  wide. 

(c)  If  we  go  swimming  or  bowling,  then  dinner  will  be  late  or  Bob  will  bring 
veggie  burgers. 

Problem  2.9.  Consider  the  statement  form  (P  A  ~'Q)  -»■  R- 

(a)  Find  the  truth  table  for  this  statement  form. 

(b)  Construct  a  different  statement  form  using  P,  Q,  and  R  such  that  if  you  call 
your  construction  S,  then  ((PA  -'Q)  ->• R)  <->  S  is  a  tautology. 

Problem  2.10.  Consider  the  statement  form  (P  V  — (?)  — >  ( RRQ ). 

(a)  Write  out  the  truth  table  for  this  form. 

(b)  Give  a  statement  in  English  that  is  in  this  form. 

(c)  Write  the  negation  of  your  English  statement,  and  simplify  the  sentence  as 
much  as  possible. 

Problem  2.11.  For  each  of  the  cases  below,  write  a  tautology  using  the  given  state¬ 
ment  form. 

For  example,  if  you  are  given  P  V  —<Q,  you  might  write  (P  V  --fi)^(G-P). 

(a)  — •(— >P); 

(b)  n(PVfi); 

(c)  -(PAg); 
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(d)  P^Q. 

Problem  2.12.  When  we  write,  we  should  make  certain  that  we  say  what  we  mean. 
If  we  write  P  /\QV  R,  you  may  be  confused,  since  we  haven’t  said  what  to  do  when 
you  are  given  a  conjunction  followed  by  a  disjunction.  Put  parentheses  in  to  create 
a  statement  form  with  the  given  truth  table. 


p 

Q 

R 

PAQVR 

T 

T 

T 

T 

T 

T 

F 

T 

T 

F 

T 

T 

T 

F 

F 

F 

F 

T 

T 

T 

F 

T 

F 

F 

F 

F 

T 

T 

F 

F 

F 

F 

Problem  2.13.  For  each  of  the  cases  below,  write  a  contradiction  using  the  given 
statement  form. 

For  example,  if  you  are  given  -i(-iP),  you  might  write  — ■(— P)  <->  — P. 

(a)  P^g; 

(b)  n(PVfi); 

(c)  ^PV^Q; 

(d)  P^Q. 

Problem  2.14.  Consider  the  following  statement:  If  /  is  not  continuous  at  1  and  —  1 , 
then  the  group  of  invariants  is  an  infinite  cyclic  group,  a  cyclic  group  of  order  2,  or 
the  trivial  group. 

You  probably  do  not  know  what  the  words  in  italics  mean,  but  you  don’t  need 
to  know  in  order  to  work  this  problem.  Just  think  of  them  as  describing  different 
objects.  This  is  an  exercise  in  restating  things  you  don’t  understand — something 
that  might  be  useful  in  the  future ! 

(a)  Write  the  form  of  this  statement  using  P,  Q ,  R ,  S,  and  T.  (It’s  possible  to  use 
fewer  variables  and  still  have  a  correct  solution.)  Say  precisely  what  each  of 
your  letters  represent. 

(b)  Write  the  negation  of  this  statement  in  words.  Use  a  phrase  that  is  as  simple 
and  direct  as  possible. 

Problem  2.15.  Consider  the  statement  “It  snows  or  it  is  not  sunny.” 

(a)  Find  a  different  statement  that  is  equivalent  to  the  given  one. 

(b)  Find  a  different  statement  that  is  equivalent  to  the  negation  of  the  given  one. 

Problem  2.16.  The  following  problem  is  well  known.  Many  different  versions  of 
this  problem  appear  in  [101]. 

On  a  certain  island,  each  inhabitant  is  a  truth-teller  or  a  liar  (and  not  both,  of 
course).  A  truth-teller  always  tells  the  truth  and  a  liar  always  lies.  Arnie  and  Barnie 
live  on  the  island. 
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(a)  Suppose  Arnie  says,  “If  I  am  a  truth-teller,  then  each  person  living  on  this 
island  is  a  truth-teller  or  a  liar.”  Can  you  say  whether  Arnie  is  a  truth-teller  or 
liar?  If  so,  which  one  is  he? 

(b)  Suppose  that  Arnie  had  said,  “If  I  am  a  truth-teller,  then  so  is  Barnie.”  Can 
you  tell  what  Arnie  and  Barnie  are?  If  so,  what  are  they? 

Problem  2.17.  Write  a  truth  table  for  (PA  (P  — >  Q))  —■ >  Q.  What  can  you  conclude? 

Problem  2.18.  Police  at  Small  Unnamed  University  have  received  a  report  that  a 
student  was  skateboarding  in  the  hall.  They  rush  to  the  scene  of  the  crime  to  deter¬ 
mine  who  the  guilty  party  is,  and  they  are  met  by  three  students:  Alan,  Bernard,  and 
Charlotte.  When  questioned,  Alan  says,  “If  Bernard  did  not  do  it,  then  it  was  Char¬ 
lotte.”  Bernard  says,  “Alan  and  Charlotte  did  it  together  or  Charlotte  did  it  alone,” 
and  Charlotte  says,  “We  all  did  it  together.” 

(a)  If  the  police  know  that  exactly  one  person  committed  the  crime,  and  exactly 
one  person  is  lying,  who  is  the  guilty  party? 

(b)  As  it  turns  out,  exactly  one  person  committed  the  crime  and  all  the  students 
are  lying.  Who  is  the  guilty  party? 

Problem  2.19.  Show  that  if  two  statements,  P  and  Q,  are  equivalent,  then  their  nega¬ 
tions,  — P  and  -<Q,  are  also  equivalent. 

Problem  2.20.  We  know  that  each  of  the  three  statements  below  is  correct.  What 
can  we  conclude?  Why? 

1.  If  he  was  killed  before  noon,  then  his  body  temperature  is  at  most  20°C. 

2.  His  body  temperature  is  at  most  20°C  and  the  police  know  who  murdered  him. 

3.  If  the  police  know  who  murdered  him,  then  he  was  killed  before  noon. 

Problem  2.21.  We  have  been  avoiding  the  use  of  “either. .  .or”  in  this  text  because 
the  English  language  uses  this  construction  ambiguously  and  the  interpretation  often 
depends  on  the  context.  Consider  the  following  two  statements: 

1.  “Either  Peter  or  Paul  ran  in  the  race.” 

2.  “You’re  either  with  us  or  against  us.” 

(a)  Write  the  (intended)  statement  form  for  each  of  the  two  statements. 

(b)  Give  two  more  examples  of  the  use  of  “either  A  or  B”  in  the  English  language 
such  that  in  the  first  of  your  examples  both  A  and  B  occur,  and  in  the  second 
of  your  examples  exactly  one  of  A  and  B  can  occur. 

(c)  The  symbol  V  is  sometimes  used  for  the  “exclusive  disjunction”;  that  is,  a 
connective  that  yields  a  tme  statement  if  and  only  if  exactly  one  of  the  two 
statements  is  true.  Give  an  equivalent  statement  form  of  P'/Q  using  only  the 
connectives  introduced  earlier  in  this  text. 

(d)  The  negation  “neither. .  .nor”  is  interpreted  exclusively  as  the  negation  of  the 
regular  disjunction  in  the  English  language.  Give  the  tmth  table  for  the  state¬ 
ment  form  “neither  P  nor  Q"  and  give  a  meaningful  English  statement  that  is 
in  this  form. 


Chapter  3 

Introducing  the  Contrapositive  and  Converse 


In  the  last  chapter  (see  Theorem  2.7)  we  saw  that  two  statement  forms,  P  and  Q,  that 
have  the  same  truth  table  are  equivalent.  This  was  also  expressed  by  showing  that  the 
equivalence,  P  <->  Q,  is  a  tautology.  When  you  are  confronted  with  a  mathematical 
statement  that  you  need  to  prove,  you  will  often  find  it  helpful  to  paraphrase  it.  You 
will  use  tautologies  to  do  so,  since  you  don’t  want  to  change  the  truth  value  of  your 
statement.  Some  useful  tautologies  appeared  in  Theorem  2.9.  More  appear  below 
and  throughout  this  chapter. 

Theorem  3.1.  Let  P.  Q,  and  R  denote  statement  forms.  Then  the  following  are  tau¬ 
tologies: 


(Distributive  property ) 

(P  A 

{Q  V  R))  <->  ((P  A 

Q) 

V  (PAP)); 

(PV 

(QAR))  <->  ((PV 

Q) 

A  (PVP)j; 

(Associative  property ) 

(PA 

(GAP))  <->  ((P A 

Q) 

AP); 

(PV 

(0vp))<-((pv 

Q) 

VP); 

(Commutative  property) 

(PA 

G)^(GAP); 

(PV 

0)<-(0VP). 

At  this  point,  you  should  be  able  to  construct  the  truth  tables  for  everything  above 
and  you  should  be  able  to  show  that  all  of  them  are  tautologies. 

Exercise  3.2.  Negate  the  following: 

(a)  (PA0)V(PAP); 

(b)  P^(QRR).  O 


Tautologies  allow  us  to  replace  one  statement  by  another.  For  example,  suppose 
you  want  to  show  that  an  integer  is  odd  or  prime.  You  can  show  that  the  integer  is 
prime  or  odd;  that  won’t  change  things  because  these  two  statements  are  equivalent. 
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This  is  a  fairly  obvious  change  that  usually  won’t  make  much  of  a  difference.  The 
same  holds  if  you  want  to  show  x  is  prime  and  odd;  you  can  show  that  it  is  odd  and 
prime  if  that’s  easier  and  you  will  have  accomplished  the  same  thing.  Similarly,  if 
you  want  to  show  that  it  is  not  the  case  that  x  is  prime  and  odd,  you  can  show  that  x 
is  not  prime  or  not  odd. 

For  implications,  restating  what  you  want  to  prove  can  really  make  a  difference. 
We  need  to  make  sure,  however,  that  what  we  have  is  equivalent  to  our  original 
statement.  So  recall  that  we  showed,  in  the  last  chapter,  that  P  — >  Q  is  equivalent  to 
-FVg. 

Now  consider  —Q  — >  ->P,  which  is  called  the  contrapositive  of  the  implication 
P  — ►  Q.  We  need  to  compare  the  two  truth  tables  below: 


P 

Y 

T 

F 

F 


Q 

T 

F 

T 

F 


P 


T 

F 

T 

T 


Q 


T 

F 

T 

T 


^P 


So  the  fact  that  the  truth  tables  are  the  same  and  Theorem  2.7  tell  us  that  the 
statement  forms  are  logically  equivalent.  What  this  means  to  us  is  that,  if  we  are 
trying  to  prove  that  an  implication  is  true  and  we  don’t  see  how  to  do  it,  we  should 
consider  the  contrapositive  of  that  statement.  Here’s  how  it  works  in  practice. 


Theorem  3.3.  Let  x  be  an  integer.  If  x2  is  odd,  then  x  is  odd. 


First  we  need  to  understand  the  problem.  What  does  it  mean  for  a  number  x  to  be 
odd?  Our  definition  of  “odd  number”  is  the  following:  An  integer  x  is  odd  if  there 
is  an  integer  n  such  that  x  =  2n+  1.  So  we  are  assuming  that  x2  =  2n+l  for  some 
integer  n  and  trying  to  show  x  =  2m  +  1  for  some  integer  m.  It’s  hard  to  see  where 
to  go  from  here,  we  think. 

Remember  that  Polya  suggests  restating  the  problem,  so  let’s  try  that.  Let  P  be 
the  sentence  “x2  is  odd”  and  Q  be  the  sentence  “x  is  odd.”  Then  we  see  that  we  wish 
to  prove  that  P  —>  Q  is  true.  But  this  is  logically  equivalent  to  ~Q  — >  -i P,  which 
translates  into  “If  x  is  not  odd,  then  x2  is  not  odd.”  We  can  do  better  than  that,  since 
an  integer  is  odd  or  even.  So  we  can  show  that  “If  x  is  even,  then  x2  is  even”  and  that 
will  be  equivalent.  Let’s  see  if  that’s  easier. 

Theorem  (Contrapositive  of  the  statement  of  Theorem  3.3).  Let  x  be  an  integer.  If 
x  is  even,  then  x2  is  even. 


The  first  step  is  to  understand  the  problem.  The  second  step  is  to  prove  it.  We’ll 
do  that  here: 

“Understanding  the  problem.”  When  is  an  integer  even?  Our  definition  of  “even 
number”  is  the  following:  An  integer  x  is  even  if  there  is  an  integer  n  such  that 
x  =  2 n.  So  we  need  to  show  that  x2  =  2m,  where  m  is  an  integer,  assuming  that 
x  =  2 n,  where  n  is  an  integer.  We  began  by  understanding  the  problem,  now  we  are 
ready  to  solve  it. 
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Proof.  Let  x  be  even.  Then  there  is  an  integer  n  such  that  x  =  2 n.  Therefore,  x2 
=  (2 n)2  =  2(2 n2).  Let  m  =  2 n2.  Then  x2  =  2m  and  m  is  an  integer.  Therefore  x2  is 
even.  □ 

Of  course,  this  takes  care  of  the  original  theorem,  since  it  is  equivalent  to  the  one 
we  proved.  Thus,  using  the  contrapositive  is  one  possible  way  to  attempt  to  prove 
that  an  implication  is  true.  We  will  soon  have  a  number  of  ways  to  attack  a  problem. 
Try  to  keep  them  all  in  mind. 

Some  other  related  remarks:  Notation  is  more  important  than  it  may  seem.  In  the 
theorem  above,  we  assume  that  x  is  even  and  try  to  show  x2  is  even.  If  we  assume 
that  x  =  2n  and  accidentally  try  to  show  x2  =  2 n  (rather  than  x2  =  2m),  we’re  stuck 
because  we  assumed  erroneously  that  x  =  x2.  In  other  words,  our  notation  would 
force  us  to  show  that  x  =  0  or  x  =  1,  which  is  not  what  we  should  be  doing.  We 
introduced  an  error  because  of  poor  notation.  So  it’s  important  that  one  symbol  be 
an  n  and  one  be  an  m. 

Also,  note  that  we  begin  the  proof  by  saying  what  we  are  assuming,  and  end  the 
proof  by  saying  what  we  are  concluding.  That  helps  the  reader  too.  Finally,  we  keep 
checking  that  m  and  n  are  integers.  That’s  because  that  is  very  important;  if  they 
weren’t  integers,  x  wouldn’t  have  to  be  even. 

So  the  contrapositive  was  very  helpful  here.  You  do  need  to  be  careful  though. 
It  must  be  the  contrapositive  and  not  the  converse.  The  converse  of  an  implication 
P  ^  2  is  the  statement  form  Q  — ►  P.  Looking  at  the  truth  tables  for  each  of  these 
given  below. 
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we  see  that  they  are  different.  Unfortunately,  though  the  contrapositive  and  converse 
of  a  statement  are  really  very  different,  students  often  confuse  them.  We’ll  take  just 
a  moment  to  convince  you  that  it  is  very  important  not  to  do  this. 

Suppose  our  statement  is,  “If  I  am  a  Hobbit,  then  I  am  under  5  feet  tall.”  This  is 
a  true  statement,  as  every  Tolkien  reader  knows.  The  converse  is  “If  I  am  under  5 
feet  tall,  then  I  am  a  Hobbit.”  This  latter  statement  is  not  true,  since  lots  of  children 
are  under  5  feet  tall,  but  most  of  them  are  not  Hobbits.  As  a  mathematical  example, 
consider  the  sentence  about  integers  “If  x  is  seven,  then  x  is  prime,”  and  its  converse 
“If  x  is  prime,  then  x  is  seven.”  Recall  that  an  integer  p  is  prime  if  p  >  1  and  p 
cannot  be  written  as  a  product  of  two  positive  integers,  both  different  from  p.  Thus, 
the  original  sentence  is  true  for  all  x,  while  the  converse  above  is  not.  On  the  other 
hand,  you  agree  that  for  all  x  the  contrapositive  “If  x  is  not  prime,  then  x  is  not 
seven,”  is  true,  as  it  must  be.  But  this  is  trickier  when  we  don’t  really  understand 
what  we  are  saying  as  well  as  we  understand  this  statement.  Remember,  make  sure 
you  understand  the  problem.  We  present  some  examples  and  exercises  for  you  to  try 
your  hand  at. 
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Example  3.4.  Consider  the  slightly  odd  sentence:  “If  the  sky  is  green,  then  2  +  2 
=  4.”  What  are  the  converse  and  the  contrapositive  of  this  implication?  Which  of 
the  following  (if  any)  is  true:  the  statement,  the  converse,  or  the  contrapositive? 

The  converse  is:  “If  2  +  2  =  4,  then  the  sky  is  green.”  The  contrapositive  is:  “If 
2  +  2  ^4  4,  then  the  sky  is  not  green.” 

To  decide  on  the  validity  of  the  statements  we  have  to  agree  on  the  truth  of  each 
part.  We  are  quite  confident  that  2  +  2  =  4,  so  this  part  is  true.  We  abbreviate  the 
statement  with  A  (for  arithmetic).  Since  we  have  never  seen  a  (natural)  green  sky 
before,  we  suggest  that  “the  sky  is  green,”  abbreviated  by  S,  should  be  considered 
as  false.  With  this  in  place,  we  look  at  the  relevant  parts  of  the  truth  tables. 


So  the  original  statement  and  its  contrapositive  are  true,  while  the  converse  is  false. 
Of  course  we  knew  beforehand  that  the  original  statement  and  the  contrapositive 
would  have  the  same  answer  because  they  are  equivalent  statements.  O 

Now  it’s  your  turn. 

Exercise  3.5.  Consider  the  sentence  “If  n  is  odd,  then  ir  -  n  —  6  is  even.” 

(a)  State  the  contrapositive. 

(b)  State  the  converse.  O 


We  should  also  mention  one  more  possibility  that  comes  up  frequently:  the  in¬ 
verse  of  an  implication  P  — >  Q  is  the  statement  form  -/J  — >  —Q.  As  you  will  show 
in  the  exercise  below,  this  form  is  nothing  new.  The  inverse  of  an  implication  can  be 
expressed  in  terms  of  our  earlier  definitions. 

Exercise  3.6.  Consider  the  implication  P  — >  Q. 

(a)  Write  the  truth  table  for  the  inverse  of  P  — >  Q. 

(b)  Express  the  inverse  of  an  implication  in  terms  of  the  converse  and  contrapos¬ 
itive  in  two  different  ways. 

(c)  State  the  relation  between  the  inverse  of  P  — *  Q  and  the  converse  of  P  — >  Q, 

and  give  a  reason  for  your  answer.  Q 


Definitions 

Definition  3.1.  An  integer  x  is  odd  if  there  is  an  integer  n  such  that  x  =  2n  +  1. 

Definition  3.2.  An  integer  x  is  even  if  there  is  an  integer  n  such  that  x  =  2 n. 

Definition  3.3.  An  integer  p  is  prime  if  p  >  1  and  p  cannot  be  written  as  a  product 
of  two  positive  integers,  both  different  from  p. 
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Solutions  to  Exercises 

Solution  (3.2).  The  equivalences  are  given  below. 

(a)  The  negation  may  be  stated  as  (-P  V  ~^Q)  A  (^P  V  ^P),  since 

-((PAG)  V  (PAP))  <-►  HPAQ)A-(PAP)) 

<-►  ((->P  V-iQ)  A  (-tPV-tR)). 

(b)  The  negation  may  be  stated  as  P  A  (—<2  V  ~>R),  since 

-(P-+(GAP))  «-►  (P  A  -■(GAP)) 

«->  (PA(-.gV-.P)). 


Solution  (3.5). 

(a)  The  contrapositive  is  “If  rr  —  n  —  6  is  odd,  then  n  is  even.” 

(b)  The  converse  is  “If  rr  —n  —  6  is  even,  then  n  is  odd.” 


Solution  (3.6). 


(a) 


T 
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-Q 


(b)  The  inverse  of  an  implication  is  the  converse  of  the  contrapositive  of  the  im¬ 
plication  and  this,  in  turn,  is  equivalent  to  the  contrapositive  of  the  converse 
of  the  implication. 

(c)  The  inverse  of  the  implication  P  — ■»  Q  is  equivalent  to  the  converse  of  the 
implication  P  — >■  Q.  To  see  this,  compare  the  truth  table  of  the  inverse  given 
in  (a)  with  the  truth  table  of  the  converse  from  page  27.  They  are  the  same 
and  thus,  by  Theorem  2.7,  the  inverse  is  equivalent  to  the  converse. 


Problems 

Problem  3.1.  (a)  Write  a  tautology  involving  only  logical  symbols,  the  implica¬ 

tion  P  — >  Q,  its  converse,  and  P  <->  Q. 

(b)  Can  you  write  a  tautology  involving  only  P  — >  Q  and  its  contrapositive?  If  so, 
how?  If  not,  why  not? 

Problem*  3.2.  (a)  Let  x  be  an  integer.  Prove  that  if  x  is  odd,  then  x 2  is  odd.  Make 

sure  you  state  your  assumption  as  the  first  line  and  your  conclusion  as  the  last 
line. 
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(b)  State  the  contrapositive  of  what  you  just  proved. 

(c)  Combining  the  result  of  part  (a)  with  Theorem  3.3  gives  a  stronger  result.  Say 
precisely  what  that  result  is. 

Problem  3.3.  For  each  of  the  following,  write  out  the  contrapositive  and  the  con¬ 
verse  of  the  sentence. 

(a)  If  you  are  the  President  of  the  United  States,  then  you  live  in  a  white  house. 

(b)  If  you  are  going  to  bake  a  souffle,  then  you  need  eggs. 

(c)  If  x  is  a  real  number,  then  x  is  an  integer. 

(d)  If  x  is  a  real  number,  then  x2  <  0. 

Problem  3.4.  State  the  contrapositive  of  each  of  the  following. 

(a)  If  it  rains,  then  it  pours. 

(b)  If  I  had  a  bell,  I  would  ring  the  bell  in  the  morning. 

(c)  The  house  is  red,  if  the  house  is  not  blue. 

(d)  Dinner  is  cooked  only  if  I  make  it. 

Problem  3.5.  State  the  converse  of  each  of  the  following. 

(a)  If  it  rains,  then  it  pours. 

(b)  If  I  am  young,  then  I  am  restless. 

(c)  I  am  alone,  if  it  is  Saturday. 

(d)  I  eat  fish  only  if  it  is  cooked. 

Problem  3.6.  State  the  inverse  of  each  of  the  following. 

(a)  If  it  rains,  then  it  pours. 

(b)  If  I  am  living  abroad,  then  I  need  brownies. 

(c)  To  run  quickly,  it  is  sufficient  to  have  long  legs. 

(d)  To  make  good  chocolate  chip  cookies,  it  is  necessary  to  have  baking  soda. 

Problem  3.7.  Consider  the  statement  form  P  — >  Q. 

(a)  Write  the  negation  of  the  converse  of  this  statement  form  in  as  simple  a  form 
as  possible. 

(b)  Write  the  negation  of  the  contrapositive  of  this  statement  form  in  as  simple  a 
form  as  possible. 

(c)  Write  the  negation  of  the  inverse  of  this  statement  form  in  as  simple  a  form 
as  possible. 

Problem  3.8.  Consider  the  sentence:  “The  horses  eat  the  grass  only  if  they  are  led 
to  the  pasture.” 

(a)  Write  the  negation  of  the  converse  of  this  sentence.  Your  answer  has  to  be 
simple  and  as  eloquent  as  possible. 

(b)  Is  the  sentence  in  (a)  the  same  as  the  converse  of  the  negation  of  the  original 
sentence?  Explain  your  answer. 
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Problem  3.9.  Let  x  and  y  be  real  numbers.  Show  that  if  x  /  v,  then  2x  +  4  /  2v  +  4. 
(Hint:  Use  the  contrapositive.) 

Problem  3.10.  Matilda  always  eats  at  least  one  of  the  following  for  breakfast:  ce¬ 
real,  bread,  or  yogurt.  On  Monday,  she  is  especially  picky. 

If  she  eats  cereal  and  bread,  she  also  eats  yogurt.  If  she  eats  bread  or  yogurt,  she 
also  eats  cereal.  She  never  eats  both  cereal  and  yogurt.  She  always  eats  bread  or 
cereal. 

Can  you  say  what  Matilda  eats  on  Monday?  If  so,  what  does  she  eat? 

Problem  3.11.  Consider  the  following  statement. 

If  the  coat  is  green,  then  the  moon  is  full  or  the  cow  jumps  over  it. 

(a)  This  odd  statement  is  composed  of  several  substatements.  Identify  each  sub¬ 
statement,  assign  a  letter  to  it,  and  write  down  the  original  statement  as  a 
statement  form  using  these  letters  and  logical  connectives. 

(b)  Find  the  contrapositive  of  the  original  statement  form  from  part  (a).  Use  this 
to  write  the  contrapositive  of  the  original  statement  as  an  English  sentence. 

(c)  Find  the  converse  of  the  original  statement  form  from  part  (a).  Use  this  to 
write  the  converse  of  the  original  statement  as  an  English  sentence. 

(d)  Find  the  negation  of  the  original  statement  form  from  part  (a).  Use  this  to 
write  the  negation  of  the  original  statement  as  an  English  sentence. 

(e)  Are  some  of  the  statements  in  this  problem  (the  original  or  the  ones  you  ob¬ 
tained)  equivalent?  If  so,  which  ones? 

Problem  3.12.  Consider  the  two  statement  forms  P  — >  Q  and  P  — >  (Q  V  ~<P). 

(a)  Make  a  truth  table  for  each  of  these  statement  forms. 

(b)  What  can  you  conclude  from  your  solution  to  part  (a)? 

Problem  3.13.  Karl’s  favorite  brownie  recipe  uses  semisweet  chocolate,  very  little 
flour,  and  less  than  1/4  cup  sugar.  He  has  four  recipes:  one  French,  one  Swiss, 
one  German,  and  one  American.  Each  of  the  four  has  at  least  two  of  the  qualities 
Karl  wants  in  a  brownie  recipe.  Exactly  three  use  very  little  flour,  exactly  three  use 
semisweet  chocolate,  and  exactly  three  use  less  than  1/4  cup  sugar. 

The  Swiss  and  the  German  recipes  use  different  kinds  of  chocolate.  The  Amer¬ 
ican  and  the  German  recipes  use  the  same  amount  of  flour,  but  different  kinds  of 
chocolate.  The  French  and  the  American  recipes  use  the  same  amount  of  flour.  The 
German  and  American  recipes  do  not  both  use  less  than  1  /4  cup  sugar. 

Karl  is  very  excited  because  one  of  these  is  his  favorite  recipe.  Which  one  is  it? 

Problem  3.14.  Let  n  be  an  integer.  Prove  that  if  3 n  is  odd,  then  n  is  odd. 

Problem  3.15.  Let  x  be  a  natural  number.  Prove  that  if  x  is  odd,  then  \/2x  is  not  an 
integer. 

Problem  3.16.  Let  x  and  y  be  real  numbers.  Show  that  if  x  ^  y  and  x,y>  0,  then 
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Problem  3.17.  In  the  statement  below,  G  is  a  group  and  H  is  a  normal  subgroup  of 
G.  (You  need  not  know  what  “group,”  “normal  subgroup,”  or  “p-group”  mean  to  do 
this  problem!) 

“If  H  and  G/H  are  p-groups,  then  G  is  a  p-group.” 

(a)  State  the  converse  of  this  statement. 

(b)  State  the  contrapositive  of  this  statement. 

(c)  Consider  the  following:  “G  is  a  p-group  if  and  only  if  H  and  G/H  are  p- 
groups.”  Write  this  in  terms  of  your  answers  to  the  first  two  parts  of  this 
problem. 

Problem  3.18.  Prove  that  if  the  product  of  two  integers  x  and  y  is  odd,  then  both 
integers  are  odd.  Describe  your  method  of  proof. 

Problem  3.19.  Consider  the  statement  “If  Simon  takes  German  or  French,  then  he 
cannot  take  Russian.” 

(a)  State  the  contrapositive  of  this  implication. 

(b)  State  the  converse  of  this  implication. 

For  parts  (c)  and  (d),  assume  the  original  statement  is  true. 

(c)  Suppose  someone  tells  you  that  Simon  did  not  take  German.  What,  if  any¬ 
thing,  can  you  conclude  about  Simon?  Why? 

(d)  Suppose  someone  tells  you  that  Simon  took  French.  What,  if  anything,  can 
you  conclude  about  Simon?  Why? 

Problem  3.20.  Consider  the  statement  form  (P  V  ~^Q)  — >  ( Rf\Q ). 

(a)  Write  out  the  truth  table  for  this  form. 

(b)  Make  up  a  meaningful  English  statement  that  has  this  form. 

(c)  Write  the  contrapositive  of  your  English  statement.  Simplify  the  sentence  as 
much  as  you  can. 


Chapter  4 

Set  Notation  and  Quantifiers 


Consider  the  sentence  “The  equation  x1  +  2x  =  15  has  a  unique  solution.”  Thus  far, 
we’ve  approached  such  things  intuitively.  It’s  time  now  to  tackle  this  head  on.  Is  it 
true?  False?  It  depends  on  which  a  we  have  in  mind,  of  course.  We  turn  to  a  rigorous 
way  to  make  our  sentence  x2  +  2x  =  15  into  a  statement.  But  before  we  get  to  the 
heart  of  this  chapter,  it  will  be  useful  to  have  notation  for  the  things  with  which  we 
frequently  work. 

We  will  need  to  understand  the  possible  set  of  values  that  the  variable  x  can 
assume.  Now  we  will  follow  the  point  of  view  that  many  mathematicians  follow: 
while  we  think  of  a  set  as  a  collection  of  objects,  we  will  define  neither  set  nor 
object.  What  we  will  do  instead  is  to  say,  carefully  and  precisely,  how  these  two 
words  can  be  used  and  we  do  so  with  axiomatic  statements.  (The  system  most  people 
use  now,  the  Zermelo-Fraenkel  system  together  with  the  axiom  of  choice,  is  stated 
in  the  Appendix  for  reference.  We  will  say  much  more  about  sets  in  Chapter  6 
and  in  subsequent  chapters.)  At  this  point  we  will  concentrate  on  understanding  the 
notation  and  commonly  used  symbols. 

We  will  write  x  £  A  to  indicate  that  x  is  an  element  of  A.  (Some  people  read 
x  £  X  as  “x  belongs  to  A,”  others  read  it  “x  is  an  element  of  A.”)  Usually  we  will 
be  considering  things  of  a  particular  type.  The  set  of  all  possible  objects  that  are 
considered  in  the  context  in  which  we  work  is  called  the  universe,  which  is  also 
sometimes  called  the  domain  of  discourse.  We  will  usually  denote  the  universe  by 
A.  In  some  cases  the  universe  may  consist  of  all  real  numbers,  or  it  may  consist  of 
all  right  triangles;  it  might  even  consist  of  all  cows  living  in  France.  The  set  may 
consist  of  all  positive  real  numbers,  all  isosceles  right  triangles,  or  all  white  cows 
living  in  France.  And  the  elements  might  be  the  real  number  n,  the  isosceles  right 
triangle  with  legs  of  length  1,  or  Farmer  Boursin’s  white  cow  Elsie,  which  lives 
in  Dijon,  France,  and  produces  a  mighty  fine  cheese.  Some  people  even  allow  the 
universe  to  be  the  “set  of  all  sets,”  even  though  this  universe  is  no  set  at  all  (see  the 
Spotlight:  Paradoxes  on  page  67). 

When  it  is  clear,  implicitly,  what  the  universe  is,  we  may  not  mention  it  explicitly. 
But  when  there  is  any  doubt  at  all,  we  will  carefully  state  what  the  universe  is.  Once 
we  do  that,  we  can  denote  a  set  by  writing  S  =  {x  £  X  :  x  satisfies  P\.  The  brackets 
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indicate  that  we  are  talking  about  a  set  of  objects,  called  elements;  x  G  X  tells  us 
where  these  elements  live,  and  P  is  a  property  these  elements  have. 

In  this  class,  as  well  as  others,  some  sets  show  up  a  lot  and  we  have  special 
notation  for  them.  Notation  should  always  be  chosen  carefully,  as  these  have  been. 
Most  mathematicians  agree  on  these,  so  don’t  make  up  your  own  notation  and  make 
sure  you  recognize  what  these  are  when  they  are  used: 

The  natural  numbers  N  =  {0, 1,2,3, . . .}. 

The  integers  Z  =  {. . . 2,  — 1,0, 1,2, . . .}. 

The  rational  numbers  Q  =  {p/q  :  p,q  &  Z  and  q  ^  0}. 

The  real  numbers  R. 

The  complex  numbers  C  =  {a  +  bi :  i2  =  —  1  and  a,b  G  R}. 

If  A  is  one  of  the  sets  Z,Q,  or  R,  then  the  set  of  the  positive  elements  is  denoted 

by  A+  =  {x  G  A  :  x  >  0}  and  the  set  of  the  negative  elements  is  denoted  by 

A~  =  {x  G  A  :  x  <  0}.  Thus  we  have  defined  Z+,Z~,Q+,(Q)_,R+,  and  R~. 

The  plane  R2  =  {(x,y)  :  x,y  G  R}. 

For  n  G  Z+,  Euclidean  n-space  R"  =  {(xi  ,X2,  ■ . .  ,xn)  :  xj  G  R  for  j  =  1,2, . . . ,«}. 

Some  authors  include  zero  in  N  and  others  don’t.  If  you  look  in  another  text, 
make  sure  you  know  what  convention  they  follow. 

For  real  numbers  a  and  b  with  a  <  b,  the  set  \a,b\  =  {x  G  R  :  a  <  x  <  b}  is 
called  the  closed  interval  from  a  to  b.  The  sets  [a,°o)  =  {x  G  R  :  a  <  x}  and  (— °°,b] 
=  {x  G  R  :  x  <  b)  are  called  unbounded  closed  intervals.  For  a  <  b,  the  set  (a,  b) 
=  {x  G  R  :  a  <  x  <  b}  is  called  the  open  interval  from  a  to  b.  We  shall  see  that  ( a7b ) 
can  be  interpreted  several  ways,  and  you  should  be  able  to  decide  which  from  the 
way  it  is  used.  You’ve  done  this  in  the  past.  For  example,  you  have  certainly  had 
courses  where  (x,y)  denotes  a  point  and,  in  the  same  course,  (x,y)  might  denote  an 
open  interval.  Unbounded  open  intervals  are  defined,  with  appropriate  changes,  in 
the  same  way  that  we  defined  unbounded  closed  intervals  above. 

Exercise  4.1.  Find  a  (different)  useful  way  to  describe  the  following  sets  (your  use¬ 
ful  way  could  be  a  sketch): 

(a)  {,v  G  Z  :  x2  =  1}; 

(b)  {x  G  N  :  x2  =  1}: 

(c)  {(x,y)  G  R2  :  y  =  0}; 

(d)  {(x,y,z)  G  R3 :  z  =  0}; 

(e)  {x  G  Z  :  x  is  even}; 

(f)  {(m,n)  :  m,n  G  Z}.  O 

Now  we  can  talk  about  slightly  more  complicated  sentences.  Think  of  the  dif¬ 
ference  between  the  statements  “In  every  box  there  is  a  prize”  and  “In  some  box 
there  is  a  prize.”  Obviously,  if  you  had  to  choose  (and  if  it  were  the  same  prize), 
you  would  go  with  the  first  one.  In  mathematics,  in  order  to  determine  the  truth  or 
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falsity  of  a  statement,  we  need  to  know  whether  we  are  talking  about  a  particular  x 
or  all  x.  What  we  mean  should  be  clear  from  the  context.  Letters  like  x  that  stand  for 
elements  of  the  universe  are  called  variables.  The  phrases  “for  all,”  “for  every,”  “for 
some,”  or  “there  exists,”  quantify  variables.  “For  all,”  or  V,  is  the  universal  quantifier 
and  “there  exists,”  or  3,  is  the  existential  quantifier. 

After  agreeing  that  the  universe  consists  of  all  real  numbers,  consider  the  follow¬ 
ing  statement:  “For  all  x  it  is  the  case  that  x2  -  1  <  0.”  We  know  that  we  are  asking 
that  for  every  x,  something  must  happen.  It  just  so  happens  that  this  statement  is 
false,  but  it  is  still  a  clear  statement.  For  all  x  is  usually  written  Vx.  So  we  could 
write 

Vx.x2  —  1  <  0. 

What  follows  the  words  “For  all  x”  in  our  statement  is  another  sentence  that  we 
could  denote  by  p,  but  since  p  is  a  sentence  involving  x,  we  write  p(x).  The  state¬ 
ment  above  is  of  the  form 

Vx,p(x). 

One  more  remark  about  the  example  above.  Suppose  the  universe  is  (still)  the 
real  numbers,  but  we  want  to  make  this  a  statement  about  positive  integers  only.  In 
that  case,  we  can  express  our  statement  symbolically  as  follows: 

Vx,  (x  e  Z+  — >  (x2  —  1  <  0)) . 

One  very  common  error  is  to  write  Vx,  (x  £  Z+  A  (x2  —  1  <  0))  rather  than  what  we 
have  written  above.  But  let’s  think  about  what  this  would  mean:  This  would  say  that 
“all  real  numbers  are  positive  integers  and  satisfy  the  inequality  x2  —  1  <  0.”  It  is 
probably  clear  now  that  this  is  not  what  the  original  statement  said. 

For  a  different  example,  suppose  that  our  universe  is  the  set  of  integers  and  con¬ 
sider  the  sentence,  “There  is  an  integer  x  such  that  x  =  0.”  This,  too,  is  a  statement, 
and  happens  to  be  true.  This  statement  can  be  expressed  symbolically  by 

3x,  (x  =  0) 

and  is  read  as  “there  exists  x  such  that  x  =  0.”  This  statement  is  of  the  form 

3x,p(x). 

One  more  remark  about  the  last  example.  If  we  had  chosen  the  set  of  the  real 
numbers  as  the  universe,  we  would  express  our  statement  symbolically  as 

3x,  (x  £  Z  Ax  =  0). 

Note  that  this  time  we  are  claiming  that  x  exists,  is  an  integer,  and  x  =  0.  To  make 
all  these  things  happen,  we  must  use  a  conjunction  rather  than  an  implication.  We 
have  included  tips  on  quantification  that  we  hope  will  be  a  helpful  guide  (see  Tips 
on  Quantification  on  page  45),  but  remember  to  analyze  your  sentences  carefully 
before  translating  them  into  symbols. 
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When  you  negate  a  statement,  you  must  be  100%  clear  on  what  your  universe  is. 
You  can  easily  see  why,  too:  if  you  negate  rgZ  and  Z  is  your  universe,  then  there 
are  no  x  left,  but  if  you  negate  x  G  Z  and  R  is  your  universe,  there  are  still  plenty 
of  x  left  to  worry  about.  So  make  sure  that  you  give  careful  consideration  to  your 
universe  before  beginning  a  problem. 

Let’s  return  to  the  introductory  example  in  this  chapter  and  apply  what  we  have 
learned. 

Exercise  4.2.  Our  sentence  was  “The  equation  x2  +  2x  =  15  has  a  unique  solu¬ 
tion.” 

(a)  Write  this  statement  in  formal  language. 

(b)  Describe  a  universe  in  which  this  statement  is  true.  Explain  briefly  why  your 
answer  is  correct. 

(c)  Describe  a  universe  in  which  this  statement  is  false.  Explain  briefly  why  your 

answer  is  correct.  O 

Before  proceeding  to  negations,  a  little  more  practice  with  quantifiers  might  be 
helpful. 

Exercise  4.3.  Write  the  statements  below  in  symbols,  assuming  that  the  universe  is 
ffi.  throughout.  Make  sure  that  you  quantify  x;  is  it  “all  x”  or  “some  x”? 

(a)  For  all  x,  it  is  the  case  that  x  is  an  integer. 

(b)  There  exists  an  integer  x  such  that  x  >  0. 

(c)  There  is  a  rational  number  x  such  that  x2  +  1  =  0. 

(d)  For  every  real  number  x,  there  exists  a  real  number  y  such  that  x  <y. 

(e)  There  is  a  real  number  y  such  that  x  <  y  for  all  x. 

(f)  If  x  is  a  rational  number,  then  x2  —  n  ^  0. 

(g)  A  real  number  x  satisfies  x2  >  0,  if  x  ^  0. 

(h)  If  x  >  0,  then  x  >  4  or  x  <  6.  O 


We  negated  conjunctions,  disjunctions,  and  implications.  Now  we  will  think 
about  the  negation  of  a  quantified  statement. 

Suppose  we  have  the  statement  “Every  cow  is  black.”  How  would  we  negate  it? 
One  pretty  useless  way  is  to  say  “Not  every  cow  is  black.”  It’s  better  to  say  “Some 
cow  is  not  black.”  So  a  useful  negation  of 

Vx,p(x) 


is 


3  \x,~>p(x). 


Similarly,  if  we  say  “There  exists  a  black  cow,”  a  useful  negation  is  “No  cow  is 
black.”  So  a  negation  of 


3x,p(x) 
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is 

Vx,->p(x). 

You  will  find  that  sometimes  you  can  negate  a  sentence  directly  and  other  times 
you  need  to  convert  to  symbols.  Here  is  another  example. 

Example  4.4.  Negate  the  sentence  “People  who  live  in  glass  houses  do  not  throw 
stones.” 

We  will  assume  that  the  universe  is  the  set  of  all  people.  What  does  this  say? 
First,  it  says  something  about  all  people  who  live  in  glass  houses.  So  we  will  use 
the  quantifier  “for  all”  and  x  will  denote  a  person.  The  notation  g(x)  will  mean  that 
x  lives  in  a  glass  house.  The  notation  t(x)  will  mean  that  x  throws  stones.  So  our 
sentence  becomes  “For  all  x,  if  g(x),  then  --t(x).”  If  you  can  negate  it  now,  go  ahead. 
If  not,  go  through  the  steps  below.  You  should  provide  reasons  why  each  step  below 
is  correct: 

1.  -<(Vx,(g(x)  ->-<t(x))); 

2-  3 x,->(g(x) ->->f(x)); 

3.  3x,-i(->g(x)  V ->*(*)); 

4.  3x,(g(x)  Af(x)). 

The  last  sentence  says  that  the  negation  of  “People  who  live  in  glass  houses  do 
not  throw  stones”  is  “There  exists  a  person  who  lives  in  a  glass  house  and  throws 
stones.”  There’s  another  important  thing  to  notice  here.  Though  there  is  no  obvious 
quantifier  in  the  sentence  “People  who  live  in  glass  houses  do  not  throw  stones,” 
we  all  interpret  the  quantifier  as  a  universal  quantifier.  If  you,  or  someone  else,  do 
not  explicitly  include  a  quantifier,  (all!)  people  will  assume  you  meant  to  insert  a 
universal  quantifier.  O 

We  emphasize  that  while  it  is  good  to  practice  these  symbolic  manipulations,  it  is 
also  important  to  understand  what  you  are  doing.  Sometimes  you  will  find  it  easier 
to  use  the  symbolic  notation  and  sometimes  you  won’t.  Make  sure  you  keep  in  mind 
what  the  sentence  says,  and  whether  or  not  your  answer  seems  reasonable.  Before 
you  go  off  on  your  own,  we’ll  do  a  fairly  complicated  example  together. 

Example  4.5.  Suppose  our  universe  is  the  set  of  real  numbers  and  we  wish  to  negate 
the  statement  “For  every  rational  number  x,  there  exists  an  integer  n  that  is  greater 
than  x.” 

So  let’s  try  it.  First  we  note  that  “For  every  rational  number  x”  means  that  we 
are  being  told  that  “if  x  is  a  rational  number”  something  will  happen.  What?  There 
will  exist  an  integer  bigger  than  x.  So  this  is  an  implication  of  the  form  “For  all 
x,  if  x  is  a  rational  number,  then  there  exists  an  n  such  that  n  is  an  integer  and 
n  >  x.”  Sometimes  it  is  easier  to  understand  a  statement  if  we  replace  the  various 
subsentences  with  symbolic  representations.  We  use 

p(x)  for  x  is  a  rational  number. 
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q(n)  for  n  is  an  integer,  and 
r(n,x)  for  n  >  x. 

Using  this  notation,  we  have 

Vx,  ( p(x )  — >  3n,  ( q(n )  A  r(n,x ))). 

Let’s  try  to  negate  this  quantified  statement  form  one  step  at  a  time,  starting  from 
the  outside. 

We  know  that  when  we  negate  “for  all”  it  becomes  “there  exists.”  In  other  words, 
we  can  replace  -i(Vx,  •  •  • )  with  3x,  ->(•  •  • ).  So  here’s  where  we  are  now: 

-■(Vx,  (p(x)  ->  3 n,  ( q(n )  A  r(w,x)))) 


is  equivalent  to 

3 x,-i(p(x)  — >  3 n,  ( q(n )  A  r(n,x))). 

Now  we  negate  the  implication.  From  the  last  chapter  we  know  that  ->(/*  — >  Q ) 
is  equivalent  to  P  A  ~Q.  We’re  up  to 

3x,  (p(x)  A  ->(3 n,  ( q{n )  A  r(n,x)))). 

We  still  need  to  negate  Q,  which  is  the  expression  3n,  ( q(n )  A  r(n,x)).  At  least 
this  is  simpler  than  what  we  started  with!  Now  3  will  change  to  V  and  so  we  need 
only  worry  about  q(n)  A  r(n,x).  But  that’s  a  conjunction.  So  the  final  step  is  to  negate 
that,  and  we  know  the  negation  of  the  conjunction  will  become  ~^q(n)  V  —•r(n,x).  So 
here’s  where  we  are  now: 

3x,  (p(x)  A  (Vh,  (-i q(n)  V  — >r(w,x)))). 

We’ve  done  what  we  were  asked  to  do,  in  a  sense,  but  our  answer  is  still  in 
symbols.  Let’s  translate  back: 

“There  exists  an  x  such  that  x  is  a  rational  number  and  for  all  n  it  is  the  case  that 

n  is  not  an  integer  or  n  is  not  greater  than  x.” 

Well,  that’s  certainly  a  mouthful.  Let’s  try  again  (explain  how  we  get  the  following): 

“There  is  a  rational  number  x  such  that  for  all  n,  if  n  is  an  integer,  then  n  <  x.” 
And  finally  (explain!): 

“There  is  a  rational  number  x  such  that  for  all  integers  n,  n  <  x.”  O 

Not  all  negations  are  this  complicated,  but  even  in  simpler  statements  there  are 
things  of  which  you  should  be  wary.  Consider  the  two  statements  about  real  num¬ 
bers:  Vx,3y,x  +  y  =  0  and  3y,Vx,x  +  y  =  0.  Assuming  the  universe  is  the  set  of  real 
numbers,  what’s  the  difference  between  these  two  statements?  In  the  first,  we  say 
that  for  each  x  we  can  find  a  y  with  x  +  y  =  0.  That’s  a  statement  you  have  known 
to  be  true  for  years,  ever  since  you  learned  about  — x.  On  the  other  hand,  the  second 
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statement  says  that  there  exists  a  y  such  that  for  all  x,  we  have  x  +  y  =  0.  That  state¬ 
ment  is  false,  because  the  same  y  would  have  to  work  for  all  x.  What’s  the  moral  of 
this  story?  That  the  order  of  the  quantifiers  is  very  important. 

Exercise  4.6.  Negate  the  statements  (a)-(h)  of  Exercise  4.3.  O 

In  order  to  provide  you  with  lots  of  exercises,  we  will  discuss  the  contrapositive, 
converse,  and  inverse  of  a  statement  with  quantifiers,  such  as  Vx,  r(x)  or  of  3x,  r(x). 
We  use  the  following  rules: 

Given  a  statement  of  the  form  Vx,  (p(x)  — >  q(x)): 

the  converse  is  Vx,  (q(x)  — >  p(x)), 

the  contrapositive  is  Vx,  (-■  q(x)  — > ~^p(x)), 

and  the  inverse  is  Vx,  (~^p{x)  — >  ~^q(x)). 

Defined  this  way,  the  contrapositive  will  be  true  precisely  when  the  original  state¬ 
ment  is  true  and,  as  before,  this  will  not  be  true  of  the  converse  or  inverse.  For 
existential  quantifiers,  we  follow  the  same  procedure: 

Given  a  statement  of  the  form  3x,  (p(x)  — >  q(x)): 

the  converse  is  3x,  (q(x)  — >  p(x)), 

the  contrapositive  is  3x,  (^q(x)  — >  ~<p(x)), 

and  the  inverse  is  3x,  (^p(x)  — >  -<q(x)). 

Exercise  4.7.  For  each  of  the  following  state  the  contrapositive  of  the  statement,  the 
converse  of  the  statement,  the  negation  of  the  contrapositive  of  the  statement,  and 
the  negation  of  the  converse  of  the  statement. 

(a)  Vx,((p(x)  Aq(x))  ->  r(x)). 

(b)  (Assume  the  universe  for  this  is  the  real  numbers.)  If  there  is  a  real  number 

strictly  between  50  and  100,  then  that  number  is  an  integer  with  square  root 
less  than  8.  O 


Solutions  to  Exercises 

Solution  (4.1).  There  are  many  possible  answers.  We  list  some  below: 

(a)  {1,-1}; 

(b)  {1}; 

(c)  the  x-axis  in  R2; 

(d)  the  xy-plane  in  R3; 

(e)  {2n:n€Z}  =  {...,-2,0,2,...}: 

(f)  the  set  of  all  points  in  R2  such  that  both  the  x  and  y  coordinates  are  integers. 
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Solution  (4.2).  Parts  (b)  and  (c)  have  alternate  solutions. 

(a)  3x,  ((x2 +  2x  =  15)  AVy, (y2  +  2y  =  15  — >y  =  x)) 

(b)  We  choose  N  as  the  universe  for  both,  x  and  y.  The  statement  is  true  because 
3  is  the  only  natural  number  that  solves  the  equation,  which  we  leave  to  you 
to  check. 

(c)  Now  we  use  R  as  the  universe  for  both  variables.  The  statement  is  then  false 
because  3  and  —5  are  two  different  solutions. 

Some  people  use  the  symbol  3!  to  indicate  the  existence  of  a  unique  element  in  the 
universe.  With  that  notation,  part  (a)  reads:  3!x,x2  +  2x  =  15. 

Solution  (4.3).  Note  that  the  universe  was  assumed  to  be  R. 


(a) 

Vx,x  e  z. 

(b) 

3x,  ((x  €  Z)  A  (x 

>0)). 

(c) 

3x,  ((x  G  Q)  A  (x 

2  +  l  =0)). 

(d) 

j? 

LU 

A 

(e) 

LU 

A 

(f) 

Vx ,  (xeQ^x2 

-^7^0). 

(g) 

Vx ,  (-i(x  =  0)  — > 

x2  >  0). 

(h) 

T 

o 

A 

£ 

>  4)  V  (x  < 

Solution  (4.6).  Note  that  the  universe  was  assumed  to  be  R. 

(a)  There  exists  an  x  such  that  x  is  not  an  integer. 

(b)  For  all  x,  the  real  number  x  is  not  an  integer  or  x  is  nonpositive.  This  is  equiv¬ 
alent  to:  For  all  x,  if  x  is  an  integer,  then  x  is  nonpositive. 

(c)  For  all  x,  if  x  is  a  rational  number,  then  x2  +  1  ^  0. 

(d)  There  exists  an  x  such  that  for  all  y  we  have  x  >  y. 

(e)  For  all  y,  there  exists  an  x  such  that  x  >  y. 

(f)  There  is  a  rational  number  x  such  that  x2  —  K  =  0. 

(g)  For  some  x,  we  have  x  ^  0  and  x2  <  0. 

(h)  There  exists  a  positive  real  number  x  such  that  x  <  4  and  x  >  6. 


Solution  (4.7). 

(a) 


For  the  contrapositive:  Vx,  (-r(x)  — >  (~>p(x)  V  - 
For  the  converse:  Vx,  (r(x)  — >  (p(x)  Aq(x))). 

For  the  negation  of  the  contrapositive:  3x,  (-ir(x)  A  p(x)  A  q(x)). 

For  the  negation  of  the  converse:  3x,  (r(x)  A  (~'p(x)  V  ^g(x))). 

(b)  We  begin  by  writing  the  statement  in  formal  language.  The  universe  for  this 
problem  is  the  set  of  real  numbers:  3x,  (50  <  x  <  100  -+  (x  £  ZA  y/x  <  8)). 
Now  for  the  contrapositive:  3x,  ((x^ZV  yfx  >  8)  — >  (x  <  50Vx  >  100)).  In 
words:  There  is  a  real  number  such  that  if  it  is  not  an  integer  or  its  square  root 
is  at  least  8,  then  the  number  is  at  most  50  or  at  least  100. 

For  the  converse:  3x,  ((x£ZA  yfx.  <  8)  — >  50  <  x  <  100).  In  words:  There 
is  a  real  number  such  that  if  it  is  an  integer  and  its  square  root  is  less  than  8, 
then  it  is  strictly  between  50  and  100. 
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For  the  negation  of  the  contrapositive:  Vx,  ((x  ^  Z  V  yfx  >  8)  A  50  <  x  <  100). 
In  words:  Every  real  number  is  strictly  between  50  and  100  and  at  least  one 
of  the  following  occur:  x  is  not  an  integer  or  the  square  root  of  x  is  at  least  8. 
For  the  negation  of  the  converse:  Vx,  (x  £  Z  A  y/x  <8A(x<50Vx>  100)). 
In  words:  Every  real  number  is  an  integer,  its  square  root  is  less  than  8,  and  it 
is  not  strictly  between  50  and  100. 


Problems 


Tips  on  Quantification  on  page  45  summarizes  many  of  the  major  points  in  this 
chapter.  You  may  find  it  helpful  to  read  these  tips  before  working  the  problems  below. 

Problem  4.1.  Write  the  following  statements  symbolically. 

(a)  For  every  x,  there  is  a  y  such  that  x  =  2 y. 

(b)  For  every  y,  there  is  an  x  such  that  x  =  2y. 

(c)  For  every  x  and  for  every  y,  it  is  the  case  that  x  =  2 y. 

(d)  There  exists  an  x  such  that  for  some  y  the  equality  x  =  2 y  holds. 

(e)  There  exists  an  x  and  a  y  such  that  x  =  2y. 

Problem  4.2.  Which  of  the  statements  in  Problem  4. 1  are  true  if  the  universe  for 
both  x  and  y  is  the  set  of  the  real  numbers? 

Problem  4.3.  Which  of  the  statements  in  Problem  4. 1  are  true  if  the  universe  for  x 
is  the  set  of  the  real  numbers  and  the  universe  for  y  is  the  set  of  the  integers? 

Problem  4.4.  Negate  the  statements  in  Problem  4. 1 . 

Problem  4.5.  Negate  the  following  sentences.  If  you  don’t  know  how  to  negate  it, 
change  it  to  symbols  and  then  negate.  State  the  universe,  if  appropriate. 

(a)  For  all  x  £  R,  we  have  x2  >  0. 

(b)  Every  odd  integer  is  nonzero. 

(c)  If  I  am  hungry,  then  I  eat  chocolate. 

(d)  For  every  girl  there  is  a  boy  she  doesn’t  like. 

(e)  There  exists  x  such  that  g{x)  >  0. 

(f)  For  every  x  there  is  a  y  such  that  xy  1 . 

(g)  There  is  a  y  such  that  xy  =  0  for  every  x. 

(h)  If  x  ^  0,  then  there  exists  y  such  that  xy  =  1. 

(i)  If  x  >  0,  then  xy 2  >  0  for  all  v. 

(j)  For  all  e  >  0,  there  exists  8  >  0  such  that  if  x  is  a  real  number  with  |x—  1 1  <8, 
then  |x2  —  1 1  <  e. 

(k)  For  all  real  numbers  M,  there  exists  a  real  number  N  such  that  |/(n)  |  >  M  for 
all  n  >  N. 
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Problem  4.6.  What  are  the  sets,  A  and  B,  described  by  the  following  statements? 

(a)  Vx,  (x  €  A  <->  Bn,  (n  £  Z  Ax  =  2n)). 

(b)  Vx,  (x  £  <->  3n ,  (n  G.  Z  A  x  =  2n  +  1 ) ) . 

Problem  4.7.  Consider  the  statement  of  Exercise  4.2  on  page  36. 

(a)  Write  the  negation  of  that  statement  using  symbols. 

(b)  Write  the  negation  of  that  statement  as  an  English  sentence. 

Problem  4.8.  Consider  the  following  statement. 

For  all  positive  integers  x,  there  exists  a  real  number  y  such  that  for  all  real  num¬ 
bers  z,  we  have  y  =  zxovz=yx- 

(a)  Write  this  statement  using  symbols  and  appropriate  quantification.  Use  R.  for 
the  universe  of  all  variables. 

(b)  Once  you  have  written  this  statement  in  symbols,  negate  the  (symbolic)  state¬ 
ment  that  you  obtained. 

Problem  4.9.  Consider  the  following  statement: 

Vx,  ((x  €  ZA~i(3 y,  (y  G  ZAx=  7y)))  — >  (3z,  (z  €  ZAx  =  2z))). 


(a)  Negate  this  statement. 

(b)  Write  the  original  statement  as  an  English  sentence. 

(c)  Which  statement  is  true,  the  original  one  or  the  negation?  Explain  your  an¬ 
swer. 

Problem  4.10.  Write  each  of  the  statements  below  using  symbolic  notation.  In  this 
problem,  use  R  as  the  universe  for  all  variables  involved. 

(a)  There  is  an  integer  that  is  bigger  than  its  square. 

(b)  Every  rational  number  is  the  product  of  two  irrational  numbers.  (Note:  A  real 
number  x  is  irrational  if  x  ^  Q.) 

(c)  There  are  integers  m  and  n  such  that  for  each  rational  number  x,  we  have 
m  <  nx  or  n  <  mx. 

(d)  Every  rational  number  is  the  solution  of  an  equation  ax  +  b  =  0,  where  a  and 
b  are  integers. 

Problem  4.11.  Why  is  this  joke  supposed  to  be  funny?  A  physicist,  a  chemist,  and 
a  mathematician  are  traveling  through  Switzerland.  From  the  train  they  spot  a  cow 
grazing  in  the  field.  The  chemist  gazes  out  the  window  and  says,  “Ah,  all  the  cows  in 
Switzerland  are  brown.”  The  physicist  says,  “No,  no.  You  can’t  conclude  that.  You 
can  only  say  that  some  of  the  cows  in  Switzerland  are  brown.”  The  mathematician 
says,  “No,  no,  no.  All  you  can  say  is  that  there  is  a  cow  in  Switzerland  that  is  brown 
on  one  side.” 

Problem  4.12.  For  each  of  the  following,  state 


4  Set  Notation  and  Quantifiers 


43 


1.  the  negation  of  the  statement; 

2.  the  converse  of  the  statement; 

3.  the  negation  of  the  converse; 

4.  the  contrapositive  of  the  statement;  and 

5.  the  negation  of  the  contrapositive. 

State  the  universe,  if  appropriate,  and  quantify  anything  that  is  quantifiable. 

(a)  Madeleine  waters  the  rosebush  only  if  it  is  Tuesday. 

(b)  If  I  ski,  I  will  fall. 

(c)  Windows  break  if  you  throw  balls  through  them. 

(d)  If  I  negate  a  sentence,  then  I  always  do  it  wrong. 

(e)  I  will  come  only  if  you  invite  me. 

(f)  For  all  positive  real  numbers  x,  there  exists  an  integer  n  such  that  1  /n  <  x. 
(For  the  universe  on  x  and  n,  use  the  real  numbers.) 

(g)  If  x  is  a  nonzero  real  number,  then  x2  ^  0. 

(h)  If  x  is  a  nonzero  real  number,  then  there  exists  a  real  number  y  such  that 
x-y  =  1. 

(i)  If  x  and  y  are  even  integers,  then  x  +  y  is  an  even  integer. 

Problem  4.13.  Find  a  different  useful  description  of  each  of  the  following: 

(a)  {xel:r2  =  2}; 

(b)  {(x,y)  Si2  :x  =  y}- 

(c)  {x  G  N :  x  <  0}; 

(d)  {x  G  Z  :  x2  >  0}. 

Problem  4.14.  Write  each  of  the  following  in  set  notation. 

(a)  The  set  of  all  odd  integers. 

(b)  The  set  of  all  points  in  the  xv-plane  above  the  line  v  =  x. 

(c)  The  set  of  all  points  in  the  xy-plane  that  are  inside  the  circle  of  radius  one. 

(d)  The  set  of  all  irrational  numbers. 

Problem  4.15.  Assume  that  the  universe  for  the  variables  below  is  R.  Consider  the 
statement  form  3M,  (( M  €  Z)  A  Vx,  (x2  <  M )). 

(a)  Negate  this  statement. 

(b)  Which  is  true,  the  statement  or  its  negation? 

Problem  4.16.  Consider  the  statement  “All  odd  positive  integers  are  prime.” 

(a)  Write  this  statement  using  logic  symbols  and  standard  set  notation  only.  You 
may  use  &  for  the  set  of  all  prime  numbers.  Make  sure  you  include  the  proper 
quantifiers. 

(b)  Negate  the  statement. 

(c)  Prove  the  statement  of  part  (a)  or  (b),  whichever  is  correct. 

Problem  4.17.  Let  a,  b1  and  c  be  fixed  real  numbers,  and  consider  the  statement  “For 
all  real  numbers  x,  if  x  is  greater  than  a  or  equal  to  b,  then  x  does  not  equal  c.” 
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(a)  Write  this  sentence  in  symbols. 

(b)  Negate  the  symbolic  statement. 

(c)  Translate  the  statement  back  into  a  (coherent)  English  sentence. 

Problem  4.18.  For  an  integer  x  consider  the  statement:  “If  8  does  not  divide  x2  —  l, 
then  x  is  even.” 

(a)  State  an  appropriate  universe  for  x. 

(b)  Write  the  statement  in  symbols. 

(c)  Negate  the  statement. 

Problem  4.19.  Assume  that  the  universe  for  all  variables  below  is  the  set  of  the  real 
numbers,  R.  Consider  the  statement: 

Vx,  (3y,  (x3  =  y2)  V  Vz,  (z2  <  0  — >  x3  ^  z2)) . 

(a)  Negate  this  statement  (keeping  the  symbolic  notation). 

(b)  Which  statement  is  true,  the  original  or  the  negation?  Give  a  very  brief  argu¬ 
ment  for  your  choice. 

Problem  4.20.  Decide  whether  statement  (3)  is  true  if  statements  (1)  and  (2)  are 
both  true.  Give  reasons  for  your  answers. 

(a)  The  three  statements  are: 

(1)  Everyone  who  loves  Bill  loves  Sam. 

(2)  1  don’t  love  Sam. 

(3)  I  don’t  love  Bill. 

(b)  The  three  statements  are: 

(1)  If  Susie  goes  to  the  ball  in  the  red  dress,  I  will  stay  home. 

(2)  Susie  went  to  the  ball  in  the  green  dress. 

(3)  I  did  not  stay  home. 

(c)  The  three  statements  are: 

(1)  If  /  is  a  positive  real  number,  then  there  exists  a  real  number  m  such  that 

m  >  l. 

(2)  Every  real  number  m  is  less  than  t. 

(3)  The  real  number  t  is  not  positive. 

(d)  The  three  statements  are: 

(1)  Every  little  breeze  seems  to  whisper  Louise  or  my  name  is  Igor. 

(2)  My  name  is  Stewart. 

(3)  Every  little  breeze  seems  to  whisper  Louise. 

(e)  The  three  statements  are: 

(1)  There  is  a  house  on  every  street  such  that  if  that  house  is  blue,  the  one  next 
to  it  is  black. 
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(2)  There  is  no  blue  house  on  my  street. 

(3)  There  is  no  black  house  on  my  street. 

(f)  Let  x  and  y  be  real  numbers. 

(1)  Ifx>5,theny<  1/5. 

(2)  We  know  y  =  1 . 

(3)  Sox <5. 

(g)  Let  M  and  n  be  real  numbers. 

(1)  If  n  >  M,  then  n2  >  M2. 

(2)  We  know  n  <  M. 

(3)  So  n2  <  M2. 

(h)  Let  x,y,  and  z,  be  real  numbers. 

(1)  If  y  >  x  and  y  >  0,  then  y  >z. 

(2)  We  know  that  y<z. 

(3)  Then  y  <  x  or  y  <  0. 


Tips  on  Quantification 


•  Check  the  universe  for  each  of  the  variables.  Write  it  down,  if  it  is  not  self- 
evident. 

•  When  a  quantifier  on  a  variable  in  a  statement  is  a  universal  quantifier,  writers 
sometimes  omit  it.  For  example,  you  may  read  a  statement  about  real  numbers 
such  as  “If  x  is  negative,  then  x2  is  positive.”  Because  the  quantifier  is  not  ex¬ 
plicitly  stated,  we  assume  the  author  meant,  “For  every  real  number  x,  if  x  is 
negative,  then  x2  is  positive.”  If  you  want  to  say  that  there  exists  such  an  x,  you 
must  include  the  existential  quantifier. 

•  Suppose  a  statement  restricts  the  variable  x  to  a  proper  subset  A  of  the  universe  as 
in  the  statement  form,  “For  all  x  G  A,  property  p(x)  holds.”  Since  x  is  universally 
quantified,  this  is  an  implication  of  the  form 

Vx,  (x£A-»  p(x)). 

•  Suppose  a  statement  restricts  the  variable  x  to  a  proper  subset  A  of  the  universe 
as  in  the  statement  form,  “For  some  x  G  A,  property  p(x)  holds.”  Since  x  is  exis¬ 
tentially  quantified,  this  is  a  conjunction  of  the  form 

3x,  (x  €  A  A  p{x)). 

•  Simple  statements  are  usually  easy  to  negate.  Just  do  it. 
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•  Complicated  statements  will  often  resist  a  “just  do.”  Write  them  out  in  symbols 
first.  Make  sure  you  know  what  the  quantifier  is  on  every  variable.  Check  for  the 
various  ways  one  can  say  “if...,  then....” 

•  Do  not  use  logical  connectives  (->,  A,  V,— >,<->)  between  quantifiers.  (Do  not  write 

“VxWy  •  •  •”  or  “VxA  Vy-  •  •  .”) 

•  Know  the  rules.  You  must  know  how  to  negate  statements  involving  existential 
quantifiers,  universal  quantifiers,  conjunctions,  disjunctions,  and  implications. 
The  most  important  negation  is  also  the  one  students  frequently  forget:  the  nega¬ 
tion  of  an  implication. 

•  Practice:  Every  time  you  get  a  definition  or  theorem,  try  negating  it.  If  you  can’t, 
this  might  indicate  that  you  do  not  fully  understand  it. 

If  you  think  you  need  more  practice,  here  it  is.  In  what  follows,  unless  otherwise 
stated,  all  variables  are  real  numbers,  and  £  and  8  represent  positive  real  numbers. 
Negate  all  of  these. 

(a)  Let  a  be  a  fixed  element  of  R.  For  every  £  there  exists  8  such  that  for  every 
x  G  R,  if  |x  —  a\  <  8,  then  \x2  —  a2\  <  e. 

(b)  For  all  x,  we  have  x  <  x+  £  for  every  e. 

(c)  For  every  integer  «,  there  exists  x>  n  such  that  x2>«2. 

(d)  For  all  x.y,  and  z,  if  x  <  y  and  z  <  0,  then  zx  >  zy. 

(e)  For  every  £,  there  exists  an  integer  N  such  that  l/n  <  e  for  all  n>N. 

(f)  For  all  x,  we  have  x  <  0  or  x  >  0. 

(g)  For  all  x,  there  exists  an  integer  n  such  that  n  >  x. 

(h)  For  all  x  and  y,  if  x  <  y  +  e  for  all  £,  then  x  <  y. 

(i)  For  every  £,  there  exists  8  such  that  8  <  £. 

We  provide  solutions  (which  you  should  not  look  at  until  you  try  the  negation!) 
for  the  first  four  exercises  below. 

(a)  Let  a  be  a  fixed  element  of  R.  There  exists  £  such  that  for  every  8  there  exists 
x  for  which  |x  —  a\  <8  and  |x2  —a2 |  >  £. 

(b)  There  exist  x  and  £  such  thatx  >  x  +  £. 

(c)  There  exists  an  integer  n  such  that  for  all  x,  if  x  >  n,  then  x2  <  n2. 

(d)  There  exist  x,y,  and  z  such  that  x  <  y,  z  <  0,  and  zx  <  zy.  O 


Chapter  5 

Proof  Techniques 


The  fact  that  Adolphe  Sax  invented  the  saxophone  hardly  proves  that  William  Tell  invented 
the  telephone. — Markus  M.  Ronner  (Swiss  journalist)1 

In  this  chapter,  we  introduce  you  to  some  of  the  most  common  proof  techniques. 
The  three  methods  we  will  examine  in  this  section  are: 

•  direct  proof  (just  get  started  and  keep  going), 

•  proof  by  contradiction  (show  that  the  negation  of  the  statement  you  wish  to  prove 
implies  the  impossible),  and 

•  proof  in  cases  (which  may  be  used  when  conditions  dictate  that  different  situa¬ 
tions  occur). 

There  are  many  more.  For  example,  another  proof  technique  that  you  may  be  fa¬ 
miliar  with  from  the  study  of  calculus  is  the  method  of  exhaustion,  such  as  comput¬ 
ing  area  or  volume  calculations  by  “filling  up  the  object”  with  a  sequence  of  more 
familiar  smaller  sets.  Sometimes  these  techniques  are  used  in  combination.  Some 
other  methods,  such  as  proof  of  existence  and  uniqueness  of  an  object  or  proof  by 
induction,  will  appear  in  subsequent  chapters. 

The  first  example  is  a  direct  proof.  We  want  to  show  that  “If  A,  then  B  is  true.” 
So  we  do  it  in  our  most  direct  manner:  We  start  with  A  and  keep  going  until  we  get 
to  B.  Before  getting  started,  we  make  sure  we  know  the  meaning  of  every  word  in 
the  implication  and  we  try  to  make  sure  that  the  implication  is  true. 

Theorem  5.1.  Ifa,b,  and  c  are  integers  such  that  a  divides  b  and  a  divides  c,  then  a 
divides  b  +  c. 

“Understanding  the  problem.”  Okay,  before  we  get  started,  let’s  identify  the  hy¬ 
pothesis  and  conclusion.  What  are  they?  The  hypothesis  is  a.  b,  and  c  are  integers 
such  that  a  divides  b  and  a  divides  c.  We  get  to  start  with  that.  What  does  a  divides 
b  mean?  We  say  that  a  nonzero  integer  a  divides  b  if  there  is  an  integer  n  such  that 
b  =  an.  There  is  even  a  standard  symbol  for  this,  namely,  a\b.  Since  we  have  already 

1  The  translation  is  ours. 
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defined  everything  here,  we  understand  the  problem  and  we  feel  confident — raring 
to  go,  in  fact.  What’s  the  conclusion  we  need  to  come  to?  The  conclusion  is  a  divides 
b  +  c,  and  we  know  what  this  means  because  we  understand  “divides.” 

“Devising  a  plan.”  So  we  know  that,  in  the  notation  we  used  above,  b  =  am  and 
c  =  an  where  m  and  n  are  both  integers.  We  need  to  show  that  a  divides  b  +  c,  or 
that  there  is  an  integer  j  with  b  +  c  =  aj.  Looking  at  what  we  were  given  and  what 
the  desired  conclusion  is  should  suggest  the  plan. 

Proof.  Since  a.h.  and  c  are  integers  such  that  a\b  and  a\c,  we  know  that  there  exist 
integers  m  and  n  such  that  b  =  am  and  c  =  an.  Therefore,  h  +  c  -  am+an  =  a(m+n). 
Since  m  +  n  is  an  integer  and  a^f  0,  we  conclude  that  a\(b  +  c).  □ 

“Looking  back."  Let’s  admire  this  proof  for  a  minute.  It’s  so  lovely.  There  are 
complete  sentences,  periods,  and  all  symbols  are  carefully  defined.  We  say  where 
we  are  starting;  that  is,  what  the  assumption  is,  and  we  end  by  saying  what  the  con¬ 
clusion  is.  Just  in  case  the  reader  hasn’t  noticed,  though,  we  indicate  that  we  are 
done  by  adding  the  little  box,  □.  Other  people  use  Q.E.D.  ( quod  erat  demonstran¬ 
dum  which  is  Latin  for  which  was  to  be  demonstrated).  Your  proofs  should  be  just 
as  appealing  as  the  one  above. 

What  follows  is  an  example  of  a  proof  by  contradiction,  sometimes  referred  to 
as  reductio  ad  absurdum.  The  idea  of  such  a  proof  is  that  we  suppose  that  what  we 
wish  to  conclude  is  false  and  show  that  something  really  silly  happens  (hence  the 
absurdum).  Below  is  an  example  of  this  idea  that  goes  back  to  the  Pythagoreans. 
This  is  one  of  two  proofs  presented  by  G.  H.  Hardy  in  his  famous  book  A  Math¬ 
ematician’s  Apology  [45],  as  an  example  of  a  beautiful  proof.  (The  first  proof  in 
Hardy’s  text  is  in  the  problems.  If  you  haven’t  read  his  book,  it  is  another  one  that 
we  highly  recommend.) 

Theorem  5.2.  The  number  \/2  is  not  rational. 

“Understanding  the  problem."  Before  we  begin,  we  make  sure  that  we  know 
what  all  the  words  mean,  what  we  are  assuming,  and  what  we  are  trying  to  prove. 
A  rational  number  is  a  number  of  the  form  p/q  where  p  and  q  are  integers,  and  q  is 
nonzero.  So  we  need  to  show  that  s/2  is  not  of  this  form;  that  is,  there  are  no  integers 
p  and  q  (with  q  nonzero)  such  that  s/2  =  p/q.  That  may  seem  like  a  tall  order,  since 
it  seems  to  mean  we  have  to  look  through  all  possible  integers!  This  leads  directly 
to: 

“Devising  a  plan.”  Perhaps  it  would  be  easiest  to  assume  s/2  =  p/q  (with  p  and 
q  integers  and  q  f=0)  and  see  what,  if  anything,  happens.  This  is  precisely  the  idea 
behind  proof  by  contradiction. 

Proof.  Suppose,  to  the  contrary,  that  +2  is  rational.  Then  there  exist  integers  p  and 
q  (with  q  nonzero)  such  that  \/2  =  p/q.  We  may  assume  that  p  and  q  have  no 
common  factor,  for  if  they  did,  we  would  simplify  and  begin  again.  Now,  we  have 
that  s/2 q  =  p.  Squaring  both  sides,  we  obtain  2q2  =  p2.  Thus  p2  is  even.  Since  p2  is 
even,  we  know  from  Problem  3.2  that  p  must  be  even.  Therefore,  p  =  2m  for  some 
integer  m.  This  means  that  2 q2  =  4m2.  Dividing,  we  see  that  q2  =  2m2.  But  this 
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means  that  q2  is  even.  Again  we  know  from  Problem  3.2  that  q  is  even.  So  p  and 
q  have  a  common  factor  2,  which  is  completely  absurd,  since  we  assumed  they  had 
no  common  factor.  Therefore  our  assumption  that  \J2  is  rational  must  be  wrong  and 
we  have  completed  the  proof  of  the  theorem.  □ 

“Looking  back.”  Note  that  we  slipped  in  a  reference  to  Problem  3.2.  If  we  hadn’t, 
you  would  have  read  “Since  p2  is  even,  p  must  be  even.”  Your  reaction  to  this  could 
have  been  “Oh  yeah,  we  did  that  already.”  That’s  fine.  But  you  could  also  have 
stopped,  tried  to  think  about  why  it  is  true,  tried  to  prove  it,  and  so  on.  That’s  fine 
too,  in  some  sense,  but  you  don’t  want  to  re-prove  everything  we  have  already  done. 
So  if  the  writer  tells  the  reader  why  something  is  true,  it  saves  the  reader  valuable 
time.  Or,  you  could  also  have  skipped  right  over  it,  never  worrying  about  why  it  is 
true.  That’s  not  fine.  You  need  to  understand  each  sentence  in  a  proof! 

Knowing  how  to  split  a  proof  into  cases,  which  we  will  refer  to  as  a  “proof  in 
cases,”  is  something  that  will  be  extremely  useful  too.  Here  is  an  example  of  some¬ 
thing  defined  in  cases.  Once  we  understand  this  definition,  we’ll  prove  something 
using  it. 

For  a  real  number  x,  the  absolute  value  of  x  is  defined  in  cases  by 


fx  if  x 
\  —  x  if  x  <  0 


Is  this  what  you  were  expecting  the  definition  to  be?  If  not,  let’s  make  sure  it 
agrees  with  what  you  were  expecting.  If  x  =  3,  then  x  >  0,  and  we  conclude  that 
|3|  =  3.  If  x  =  —3,  then  x  <  0,  and  we  conclude  that  |  —  3|  =  —(—3)  =  3.  If  you  feel 
comfortable  with  this  definition,  you  are  ready  to  move  on  to  the  theorem.  If  not, 
work  out  a  few  more  examples  and  then  move  on. 

Theorem  5.3.  Let  x  and  y  be  real  numbers.  Then  |xy|  =  |x||y|. 

We  made  sure  that  we  understood  the  definition  of  absolute  value  before  pro¬ 
ceeding  to  the  theorem,  so  we  understand  the  problem.  Let’s  think  about  devising  a 
plan. 

“Devising  a  plan.”  Absolute  value  was  defined  in  cases,  and  therefore  |xyj  de¬ 
pends  on  whether  xy  >  0  or  xy  <  0.  The  first,  xy  >  0,  is  actually  two  cases  again: 
xy  >  0  or  xy  =  0.  What  are  the  possibilities?  Well,  xy  >  0  would  mean  that  both 
x  >  0  and  y  >  0,  or  both  x  <  0  and  y  <  0.  The  case  xy  =  0  would  mean  that  x  =  0 
or  y  =  0.  The  final  possibility,  xy  <  0,  would  mean  that  one  of  the  two,  x  or  y,  is 
negative  and  the  other  is  positive.  It  seems  that  we  have  four  cases  to  consider:  both 
x  and  y  positive,  both  negative,  at  least  one  of  the  numbers  is  zero,  and  one  of  the 
two  numbers  negative  while  the  other  is  positive. 

Proof.  First,  suppose  that  x  >  0  and  y  >  0.  Then  xy  >  0  and  we  have  |xy|  =  xy, 
|x|  =  x,  and  |y|  =  y.  Therefore, 


M  =  xy  =  |x||y|  , 


and  we  have  established  the  result  in  this  case. 
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Second,  suppose  that  x  <  0  and  y  <  0.  Then  xy  >  0  and  we  have  \xy\  =  xy, 
|jc|  =  —x,  and  |y|  =  —y.  Therefore, 

\xy\=  xy  =  {-x){-y)  =  \x\\y\  , 


and  we  have  the  result  for  this  case  as  well. 

Third,  suppose  that  x  =  0  or  y  =  0.  Then  xy  =  0  and  we  have  \xy\  =  0.  Also, 
|x|  =  0  or  |y|  =  0.  Therefore, 


M  =  0=  \x\\y\  , 

establishing  the  result  in  this  case  too. 

For  our  final  case,  suppose  that  one  number  is  positive  and  the  other  is  negative. 
Thus,  we  may  assume  that  x  <  0  and  y  >  0.  Then  xy  <0  and  we  have  \xy\  =  —  {xy), 
\x\  =  —x,  and  |y|  =  y.  Therefore, 

M  =  -{xy)  =  {~x)y=  \x\\y\ . 

We  have  now  established  the  result  for  all  four  possible  cases  and  we  may  conclude 
that  |xy|  =  \x\ |y|  for  all  real  numbers  x  and  y.  □ 

Once  again,  look  at  the  form  of  the  proof.  There  are  four  cases  and  we  tell  the 
reader  which  case  we  are  discussing  before  we  discuss  it.  We  can  conclude  some¬ 
thing  in  each  case,  but  it  isn’t  until  we  cover  all  four  possible  cases  that  we  can  write 
“we  may  conclude  that  \xy\  =  |x| |y|  for  all  real  numbers  x  and  v.” 

It  will  also  be  helpful  to  know  how  to  show  something  is  not  true.  A  statement 
whose  truth  is  anticipated,  but  for  which  we  have  no  proof  yet  is  called  a  conjecture. 
There  are  many  different  ways  that  one  might  arrive  at  a  conjecture.  It  can  be  due 
to  the  intuition  or  insight  of  a  great  mathematician,  or  it  can  be  a  generalization  of 
observations  gleaned  from  many  examples.  The  latter  has  become  more  common  in 
recent  years,  in  part  due  to  the  capabilities  of  powerful  calculators  and  computers. 
Once  we  find  a  proof,  the  conjecture  turns  into  a  theorem.  One  of  the  most  famous 
examples  in  recent  history  is  a  proof  by  Andrew  Wiles.  In  1995,  Wiles  turned  Fer¬ 
mat’s  last  conjecture  into  Fermat’s  last  theorem,  [111].  Fermat’s  last  theorem  was  a 
conjecture  for  over  350  years.  (Watch  the  excellent  Nova  episode  “The  Proof”  for 
the  full  story  on  the  history  of  Fermat’s  last  theorem,  [10].) 

Another  recent  and  major  achievement  was  the  proof  of  the  100-year-old  Poincare 
conjecture  by  Grigori  Perelman.  In  2002  and  2003,  Perelman  made  three  papers 
available  on  the  Internet  and,  by  2006,  experts  were  convinced  that  Perelman’s  pa¬ 
pers  provided  a  positive  answer  to  the  Poincare  conjecture.  Such  an  achievement 
was  well  worthy  of  the  Fields  Medal,  one  of  the  top  two  honors  a  mathematician  can 
receive  and  often  considered  the  equivalent  of  the  Nobel  Prize  (with  the  other  award 
being  the  Abel  Prize).  Perelman  declined  the  Fields  Medal.  Because  the  Poincare 
conjecture  is  one  of  the  seven  millennium  problems  for  which  the  Clay  Mathemat¬ 
ics  Institute  has  offered  a  one  million  dollar  award  for  the  solution,  Perelman  was 
offered  this  award.  As  expected,  he  declined  this  prize,  too.  There  is  a  biography 
of  Grigori  Perelman  and  the  culture  of  Russian  mathematicians  in  the  20th  century. 
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[33].  The  Clay  Institute  has  a  website  with  all  seven  problems  listed,  and  some  of 
them  are  formulated  as  conjectures.  (See  [19].) 

It’s  important  to  note  that  just  because  you  believe  something  might  be  true, 
doesn’t  mean  that  it  necessarily  is  true.  Sometimes  you  will  find  that  a  conjecture 
someone  else  has  made  (or  even  one  that  you  have  made)  is,  in  fact,  false.  In  these 
cases,  you  need  to  find  an  example  of  something  that  satisfies  the  hypotheses  of  your 
conjecture,  but  not  the  conclusion.  An  example  is  the  following  conjecture  of  Pierre 
de  Fermat — one  of  the  very  few  of  his  conjectures  that  turned  out  to  be  wrong. 

Consider  numbers  of  the  form  22  +1,  where  m  is  a  natural  number.  The  first 
number,  22  +  1  =  3,  is  prime.  The  second,  2 2  +  1  =  5,  is  also  prime,  as  are  the 
third,  fourth,  and  fifth  numbers.  In  fact,  Fermat  conjectured  that  if  m  is  a  nonnegative 
integer,  then  22  +  1  is  prime.  In  1732,  the  Swiss  mathematician  Leonhard  Euler 
showed  that  this  was  false  by  showing  that  the  sixth  number  in  this  list,  2+1 
:  4294967297,  can  be  factored.  In  fact,  our  calculator  tells  us  that 

225  +  l  =641 -6700417. 

Thus  Fermat’s  conjecture  is  false. 

An  example  that  shows  that  a  statement  is  false  is  called  a  counterexample.  You 
only  need  one  to  show  something  is  false! 

We  end  this  chapter  by  discussing  one  final  statement  form  that  appears  fre¬ 
quently  in  mathematics,  and  this  is  a  statement  of  the  form  “A  if  and  only  if  B.”  This 
is  a  really  convenient  way  of  saying  two  things:  A  if  B  and  B  if  A.  In  other  words, 
it’s  two  statements  in  one!  We  have  already  seen  examples  of  this;  for  example,  in 
Problem  3.2  (c)  you  summarized  a  result  as 

An  integer  a-  is  odd  if  and  only  if  a2  is  odd. 

We  proved  this  statement  in  two  installments.  First  we  showed  the  implication  of 
Theorem  3.3:  For  all  real  numbers  x,  if  x2  is  odd,  then  x  is  odd.  We  proved  the 
converse  in  Problem  3.2  (a):  For  all  real  numbers  x,  if  x  is  odd,  then  x2  is  odd. 
This  approach,  proving  the  necessity  and  sufficiency,  is  what  you  will  do  each  and 
every  time.  So  whenever  you  have  to  prove  an  “if  and  only  if”  statement  you  should 
rejoice:  You  get  to  give  two  proofs  for  one  problem.  To  make  your  proof  clear  and 
easy  to  follow,  it’s  a  good  idea  to  break  the  proof  into  two  parts,  indicate  clearly 
which  part  you  are  proving,  and  indicate  clearly  when  you  have  finished  proving 
each  part.  Here’s  an  example. 

Example  5.4.  Show  that  for  a  real  number  x,  we  get  —2  <  x  <  1  if  and  only  if 

(2x+  l)/(x—  1)  <  1. 

“Devising  a  plan.”  We  have  to  prove  an  equivalence,  so  we  have  two  separate  prob¬ 
lems,  each  of  which  is  an  implication.  We  will  first  show  that  for  all  real  numbers 
x,  if  -2  <  x  and  x  <  1,  then  (2x+  1 )  /  (xr  —  1)  <  1.  A  direct  proof  should  work  here. 
Then  we  will  prove  the  converse:  For  all  real  numbers  x,  if  (2x+  l)/(x—  1)  <  1, 
then  —  2  <  x  and  x  <  1.  We  will  find  it  convenient  to  multiply  the  inequality  by 
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x  —  1.  Since  we  don’t  know  whether  x  —  1  is  positive  or  negative,  we  will  need  to 
consider  two  cases.  Consequently,  we  expect  this  proof  to  be  a  proof  in  cases. 

Proof.  We  first  assume  that  2  <  x  and  x  <  1 .  Adding  x  +  1  to  the  first  inequality, 
—2  <  x,  we  get  x  —  1  <  2x+  1.  Subtracting  1  from  the  inequality  x  <  1  leads  to 
x  —  1  <  0.  Therefore,  when  we  divide  by  x  —  1,  this  “flips  the  sign.”  As  a  result  we 
obtain  (2x+  l)/(x—  1)  <  1,  as  required. 

For  the  converse  we  assume  that  (*)(2x+  l)/(x—  1)  <  1.  Note  first  that  this 
implies  that  x  f  1.  Now  our  proof  will  be  in  cases,  x  —  1  >  0  and  x  —  1  <  0.  So  first 
suppose  thatx—  1  >  0.  Multiplying  (*)  by  x-  1,  we  get  2x+  1  <  x—  1.  This,  and  our 
assumption  thatx—  1  >  0,  implies  0  <  x—  1  <  —3.  This  is  a  contradiction  and  shows 
that  this  case  is  impossible.  Now  consider  the  remaining  case,  namely,  x  —  1  <  0,  or 
equivalently,  x  <  1.  As  before,  this  implies  that  2x+  1  >  x  —  1.  This,  in  turn,  implies 
that  x  >  —2.  We  conclude  that  —2  <  x  <  1,  completing  the  proof  of  the  converse. 

Since  we  have  proved  both  the  statement  “—2  <  x  <  1  implies  (2x+  l)/(x  —  1) 
<  1,”  and  its  converse,  we  have  completed  the  proof  of  the  result.  □ 

“Looking  back.’’  Reviewing  the  structure  of  the  proof  above,  we  note  that  we  have 
proved  an  implication  and  its  converse.  We  use  certain  phrases  for  the  benefit  of  the 
reader;  for  example,  when  we  say  “We  will  first  show  that...,”  this  is  to  let  the  reader 
know  what’s  coming.  Of  course,  the  astute  reader  would  figure  it  out,  eventually, 
without  being  told.  But  most  people  like  to  be  told  where  they  are  going  before 
they  get  to  their  destination,  and  the  same  is  true  in  mathematics.  After  indicating 
clearly  what  our  hypothesis  is,  we  work  until  we  obtain  the  desired  conclusion  and 
we  say  when  we  have  reached  our  conclusion.  There  are,  sometimes,  “if  and  only 
if”  proofs  in  which  each  step  of  the  proof  is  reversible,  and  some  authors  will  work 
both  directions  simultaneously  to  save  time  and  space.  Even  in  this  case,  it  is  less 
confusing  for  the  reader  if  you  prove  one  direction  and,  after  carefully  checking  that 
all  steps  are  reversible,  say  something  on  the  order  of  “since  all  the  steps  above  are 
reversible,  the  converse  follows.”  O 

It’s  very  convenient,  when  taking  notes  or  writing  quickly,  to  follow  the  lead  of 
many  mathematicians  and  write  “iff"  for  “if  and  only  if.”  While  it  can  be  a  real  time 
saver,  we  don’t  recommend  it  in  formal  writing. 


Definitions 

Definition  5.1.  A  nonzero  integer  a  divides  an  integer  b  if  there  is  an  integer  n  such 
that  b  =  an.  We  write  this  as  a\b. 

Definition  5.2.  For  a  real  number  x,  the  absolute  value  of  x  is  defined  to  be 
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Problems 

Problem  5.1.  Below  is  the  other  proof  Hardy  chose  to  present  ([45,  pp.  92-94]). 
This  theorem  and  its  proof  were  known  to  Euclid,  and  appear  in  the  Elements  IX 
20,  [47].  Can  you  read  and  understand  this  proof?  Read  the  whole  thing.  Underline 
anything  you  don’t  understand  the  first  time.  Reread  it  slower  this  time.  Underline 
anything  you  can’t  figure  out.  You  may  need  to  spend  10  minutes  on  each  sentence; 
you  may  not.  Then  write  the  general  idea  of  the  proof  in  “street  talk.”  A  bright, 
interested  twelve-year-old  should  be  able  to  follow  your  outline  of  the  proof. 

Before  you  begin,  make  sure  you  understand  what  will  be  assumed  and  what  we 
will  try  to  do.  Make  sure  you  know  what  all  the  words  mean.  “Infinite”  has  not  yet 
been  defined;  prime  number  has. 

Theorem  5.5.  There  are  infinitely  many  prime  numbers. 

Proof.  To  prove  this  statement,  suppose,  to  the  contrary,  that  there  are  finitely  many 
primes.  Then  we  may  write  these  finitely  many  primes  in  ascending  order  as 

2.3.5. . ..,N , 

where  N  is  the  largest  prime.  Now  consider  the  number  M  defined  by 

M  =  (2-3-5 . N)  +  1  . 

If  M  is  prime,  then  M  is  a  prime  that  is  larger  than  the  largest  prime  N.  Therefore, 
we  must  conclude  that  M  is  not  prime,  and  so  it  is  divisible  by  some  prime  number, 
P.  However,  P  must  appear  in  the  list  of  primes 

2.3.5..  ..,1V, 

which  we  gave  earlier.  But  when  we  divide  M  by  P,  we  obtain  a  remainder  of  1 . 
Therefore,  P  cannot  be  a  factor  of  M,  and  we  have  contradicted  our  assumption  that 
there  are  finitely  many  primes.  Thus,  there  exist  infinitely  many  primes.  □ 

Problem  5.2.  Prove  that  if  n  is  an  integer,  then  An2  +4n  +  8  is  an  even  integer.  What 
kind  of  proof  did  you  use? 

Problem  5.3.  Prove  that  if  n  is  an  integer,  then  i  f  +  3n  +  2  is  an  even  integer.  What 
method  of  proof  did  you  use? 

Problem  5.4.  In  this  problem,  we  outline  a  proof  of  the  following  theorem: 

Theorem  5.6.  Let  x  and y  be  real  numbers.  Ifxy  >  1  /2,  then  x2  +y2  >  1. 

Your  mission  is  to  fill  in  the  gaps  and  blanks,  leaving  no  detail  omitted. 

Proof.  The  proof  will  proceed  by  ( insert  name  of  proof  technique  or  description  of 
proof  strategy  here).  So  suppose  that  x2  +y2  <  1.  Now  we  know  that  (x  —  y)2  >  0. 
f Insert  missing  steps  of  proof  here.)  Therefore  xy  <  1  /2,  and  the  proof  is  complete. 

□ 
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Problem  5.5.  In  this  problem,  we  outline  a  proof  of  the  following  theorem: 

Theorem  5.7.  Ifn  and  m  are  nonzero  integers,  then  n2  —  m 2  ^  1 

Your  mission  is  to  fill  in  the  gaps  and  blanks,  leaving  no  detail  omitted. 

Proof.  The  proof  will  proceed  by  ( insert  name  of  proof  technique  or  description  of 
proof  strategy  here).  So  suppose  that  n2  —  m2  =  1.  Then  (n  —  m)(n  +  m)  =  1.  Since 
( insert  relevant  reason  here),  we  conclude  that  both  factors,  ( n  —  m)  and  ( n+m ),  are 
equal.  Now  we  cannot  have  n  —  m  =  n  +  m,  because  ( insert  reason  here).  Therefore 
(insert  concluding  sentence).  □ 

Problem  5.6.  Consider  the  two  statements  about  real  numbers  x  and  y. 

Statement  1 .  If  the  product,  xy,  is  not  a  rational  number,  then  x  or  y  must  be  an 
irrational  number. 

Statement  2.  If  x  is  a  rational  number  and  y  is  an  irrational  number,  then  x  +y  is 
irrational. 


(a)  What  method(s)  of  proof  would  you  use  to  prove  Statement  1?  Statement  2? 
Why? 

(b)  Prove  Statement  1  in  the  most  efficient  way  possible.  Do  you  need  to  modify 
your  answer  to  the  previous  part  of  the  problem?  Explain. 

(c)  Prove  Statement  2  in  the  most  efficient  way  possible.  Do  you  need  to  modify 
your  answer  to  the  previous  part  of  the  problem?  Explain. 

Your  first  proof  of  these  statements  may  not  be  the  best  one.  Run  through  the  various 
techniques  until  you  come  up  with  the  nicest  proof  of  each  statement! 

Problem  5.7.  Provide  counterexamples  to  each  of  the  following. 

(a)  Every  odd  number  is  prime. 

(b)  Every  prime  number  is  odd. 

(c)  For  every  real  number  x,  we  have  x2  >  0. 

(d)  For  every  real  number  x  ^  0,  we  have  1  jx  >  0. 

(e)  Every  function  / :  M  — >  M  is  linear  (of  the  form  mx  +  b). 

Problem  5.8.  Define  two  sets,  A  and  B,  by 

A  =  {x  €  Z  :  x  =  2n  for  some  n  £  Z}  and 
B  =  {x  £  Z  :  x  =  2 m  +  1  for  some  m  £  Z}. 


(a)  Using  these  definitions,  give  a  rigorous  proof  that  A  and  B  have  no  element  in 
common.  Make  sure  you  write  out  all  details. 

(b)  What  type  of  proof  did  you  use  in  part  (a)? 
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Problem  5.9.  Let  n  be  an  integer.  Prove  that  if  ir  is  divisible  by  3,  then  n  is  divisible 
by  3. 

Problem  5.10.  Show  that  \/3  is  not  rational.  (You  may  want  to  use  the  result  of 
Problem  5.9  to  work  this  problem.) 

Problem  5.11.  Prove  that  V2  +  \/3  is  not  a  rational  number. 

Problem  5.12.  Prove  that  the  square  of  an  integer  cannot  be  of  the  form  3k  +  2, 
where  k  is  an  integer. 

Problem  5.13.  Prove  that  sin2x  <  |  sinx  for  all  r£l. 

Problem*  5.14.  Let  x  be  a  real  number. 

(a)  Prove  that  —  |x|  <  x  <  |x|. 

(b)  Let  a  >  0.  Prove  that  |x|  <  a  if  and  only  if  —a  <x<a. 

(c)  Use  parts  (a)  and  (b)  to  prove  the  theorem  below. 

Theorem  5.8  (The  triangle  inequality).  Let  x  and  y  be  real  numbers.  Then 

|*+y|<|x|  +  |y|  ■ 

Problem  5.15.  Prove  the  lower  triangle  inequality:  Let  x  and  y  be  real  numbers. 
Then 

IH-|y||<|x-y|. 

There  are  many  ways  to  do  this,  but  we  would  like  to  firmly  suggest  using  the 
triangle  inequality  (see  Theorem  5.8  above). 

Problem  5.16.  Prove  that  for  real  numbers  z  and  w, 

|(l  +  z)(l  +  w)-l|  <  (l  +  |z|)(l  +  M)-l  ■ 

Problem  5.17.  Prove  or  refute  the  following  conjecture.  There  are  no  positive  inte¬ 
gers  x  and  y  such  that  x2  y2  =10. 

Problem  5.18.  Prove  or  refute  the  following  conjecture.  There  are  no  positive  inte¬ 
gers  x  and  y  such  that  x2  —  3xy  +  2 y2  =10. 

Problem  5.19.  Prove  that  for  all  real  numbers  x,  we  have  x  <  — 5  if  and  only  if 

1  <  (2x+  3) /(x  —  2)  <2. 

Problem  5.20.  Find  all  points  in  the  xy- plane  that  lie  on  the  surface 

4  =  5(x  —  3)2  +  3(y  —  7r)2  +  2(z+2)2. 

Write  up  your  solution  carefully.  What  method  of  proof  did  you  use? 

Problem  5.21.  Let  n  be  an  integer.  Prove  that  if  ir  —  (n  —  2)2  is  not  divisible  by  8, 
then  n  is  even. 
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Problem  5.22.  Let  n  £  Z+,  ao, . . .  ,a„  £  R,  and  an  ^  0.  Prove  that  the  polynomial 

p(x )  =  anxn+an^ \xn~x  H - l-flo  can  have  at  most  n  different  roots.  (Some  remarks 

are  in  order  here.  To  work  this  problem,  you  must  understand  it.  Recall  that  c  £  R 
is  a  root  of  a  polynomial  p  if  p(c)  =  0.  In  order  to  solve  the  problem,  you  also  need 
to  recall  that  if  c  is  a  root  of  p,  then  x  —  c  is  a  factor  of  p.) 

Problem  5.23.  Let  a i, . . .  ,flio  be  real  numbers  and  y  =  (x  —  a i)  •  •  •  (x  —  aio)  be  the 
equation  of  a  curve  in  the  plane  R2.  Prove  that  this  curve  has  a  horizontal  tangent 
at  x  =  a\  if  and  only  if  a \  =  a/  for  some  j  in  {2,3, ... ,  10}.  (Note:  This  problem 
requires  knowledge  of  calculus.) 

Problem  5.24.  Consider  the  following  statement. 

Vx,  (x  £  Z+  — >  By,  B z, 

(0  £  Q)  A  (z  £  Q)  A  (yz  ±  0)  A  (x2  =  y1  +  z2))). 


(a)  Change  this  symbolic  statement  to  an  English  sentence. 

(b)  Prove  the  statement  you  found  in  (a). 


Tips  on  Definitions 


“When  I  use  a  word,”  Humpty  Dumpty  said,  in  a  rather  scornful  tone,  “it  means  just  what  I 
choose  it  to  mean — neither  more  nor  less.” — Lewis  Carroll,  [  17,  p.  163] 

In  your  previous  courses,  you  may  or  may  not  have  had  to  memorize  definitions. 

Now  it  becomes  essential  that  you  memorize  them,  understand  them,  and  investigate 

them  before  venturing  on  to  use  them.  Here  are  some  suggestions  on  how  to  do  these 

things. 

•  The  first  step  is  to  make  sure  you  know  the  definition.  This  does  not  mean  that 
you  highlight  it  with  a  marker  and  read  it  over  a  few  times.  It  means  that  you,  first 
of  all,  understand  it,  and,  second  of  all,  memorize  it.  You  must  know  whether  the 
quantifiers  are  “for  all”  or  “there  exist,”  you  must  know  what  order  they  come  in, 
you  must  watch  the  order  on  implications,  and  you  must  be  sure  that  what  you 
write  is  correct.  Every  single  itty  bitty  detail  must  be  correct  or  chances  are  that 
your  definition  is  wrong. 

•  It’s  very  difficult  to  memorize  something  you  don’t  understand.  So  once  you  see 
a  definition  (in  bold  black  print  in  this  book)  write  it  down  and  think  about  what 
it  means. 

•  Give  many  examples,  until  you  feel  that  you  know  what  an  example  looks  like. 

•  Negate  the  definition  and  try  to  find  nonexamples  (that  show  when  things  won’t 
satisfy  the  definition). 
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•  Go  back  and  see  if  you  can  write  out  the  definition  without  looking  at  it.  Wait  a 
few  hours  and  do  that  again.  If  anything  is  out  of  place,  ask  yourself  if  it  matters. 
If  it  does,  repeat  the  appropriate  steps  here. 

•  Definitions  are  often  stated  as  implications.  This  leads  students  to  ask  if  the  defi¬ 
nition  is  an  equivalence.  The  answer  is  “yes.”  Consider  the  following  definition: 
“An  integer  m  is  even  if  there  exists  an  integer  n  such  that  m  =  2 n.”  Since  this 
is  how  we  defined  “even,”  we  also  mean  that  “if  m  is  even,  then  there  exists  an 
integer  n  such  that  m  =  2n .” 

•  Note  that  you  either  have  defined  something  correctly  or  not;  you  can’t  “get 
close”  to  a  definition  unless  you  get  it  right.  For  example,  suppose  you  are  asked 
to  define  cat.  Let’s  say  your  definition  is  “A  carnivorous  mammal,  domesticated 
since  early  times.”  Fair  enough,  it  seems.  Now  suppose  that  working  with  this 
definition  you  try  to  purchase  a  mail-order  cat  and  you  receive  a  large  St.  Bernard. 
While  this  might  upset  you,  it  shouldn’t  surprise  you.  Your  definition  was  close 
enough  that  you  didn’t  receive  a  mail-order  minivan,  for  example,  but  it  was  also 
wrong.  In  the  same  way,  mathematicians  will  be  upset  if  your  definition  includes 
things  it  shouldn’t.  Be  very  careful. 

Some  teachers  and  students  find  it  helpful  to  make  definition  notebooks.  In  such 
a  notebook,  you  will  do  all  the  steps  above  as  often  as  necessary.  We  heartily  rec¬ 
ommend  such  an  approach. 

In  this  text,  we  summarize  the  definitions  at  the  end  of  each  chapter.  This  is 
intended  to  help  you  with  a  definition  notebook;  it  is  not  meant  to  replace  the  note¬ 
book.  Working  the  following  additional  problems  will  help  you  appreciate  the  value 
of  a  good  definition. 

Problem  5.25.  Consider  an  object  that  all  of  you  know  well:  A  car. 

(a)  Define  a  car.  (You  may  not  say,  “A  car  is  an  automobile.”  Why  are  we  ruling 
this  out?) 

(b)  Give  an  example. 

(c)  Give  a  few  nonexamples.  Your  nonexample  should  be  close  to  the  definition 
of  “car”  but  not  close  enough  to  be  correct. 

Problem  5.26.  Definition.  We  will  call  a  natural  number  an  s-difference  if  it  is  the 
difference  of  the  squares  of  two  natural  numbers. 

(a)  Give  three  different  examples  of  s-differences. 

(b)  Write  the  definition  symbolically.  What  should  the  universe  be? 

(c)  Give  two  nonexamples  from  the  universe  that  you  chose  in  (b). 

Problem  5.27.  See  Problem  5.26  for  the  definition  of  an  s-difference. 

Definition.  We  will  call  an  s-difference  simply  even ,  if  it  is  even  but  not  a  multiple 
of  4. 

(a)  Give  an  example  of  a  simply  even  s-difference. 

(b)  Give  two  nonexamples. 
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(c)  Is  this  a  useful  definition?  Why  or  why  not? 

Problem  5.28.  At  the  beginning  of  Chapter  4  we  noted  that  set  is  an  undefined  ex¬ 
pression.  Some  texts  define  set  to  be  “a  collection  of  objects.”  Why  is  this  “defini¬ 
tion”  not  satisfactory? 

Problem  5.29.  For  the  first  three  parts  of  this  problem,  you  will  need  to  use  your 
knowledge  of  calculus. 

(a)  Consider  the  following:  A  function  /  :  K.  — >  R.  is  differentiable  if  it  is  contin¬ 
uous.  Is  this  a  definition  of  differentiable'?  Why  or  why  not? 

(b)  Consider  the  following:  A  sequence  of  real  numbers  is  a  list  of  real  numbers. 
Is  this  a  definition  of  a  sequence  of  real  numbers ?  Why  or  why  not? 

(c)  We  found  the  following  “definition”  of  tangent  line  online:  “A  line  that 
touches  a  curve  at  a  point  without  crossing  over.”  What’s  wrong  with  this 
definition?  (Say  as  much  as  you  can!) 

(d)  Is  this  a  definition  of  perfect  square  ?  A  real  number  is  a  perfect  square  if  there 
exists  n  such  that  x  =  n2. 


Chapter  6 

Sets 


In  Chapter  4  we  introduced  the  terms  set  and  element  of  a  set.  In  order  to  have  some 
flexibility  and  to  avoid  the  slightly  awkward  phrase  set  of  sets,  we  will  use  the  word 
collection  as  a  synonym  for  set.  In  particular,  we  will  usually  speak  of  a  collection  of 
sets,  which  is  just  a  set  of  sets.  Just  as  you  worked  with  points  and  lines  in  geometry 
without  having  a  rigorous  definition  of  those  terms,  we  will  ask  you  to  use  your 
intuition  as  you  work  with  the  terms  set  and  element.  It’s  important  to  note  that  you 
will  need  to  exercise  care  when  you  use  these  words.  Mathematics  describes  very 
carefully  and  exactly  what  you  can  do  with  sets  and  when  you  can  use  the  words 
element  of  a  set.  In  particular,  the  construction  of  “the  set  of  all  sets”  is  forbidden, 
as  this  would  lead  to  contradictions.  In  order  to  get  around  such  contradictions, 
mathematicians  have  developed  axioms.  These  are  listed  in  the  Appendix  as  the 
Zermelo-Fraenkel  system  together  with  the  axiom  of  choice  (ZFC,  for  short).  At  this 
point,  we  will  introduce  our  subject  in  a  less  formal  way,  leaving  a  more  axiomatic 
treatment  for  a  later  course  in  set  theory. 

The  objects  that  make  up  a  set  are  called  the  elements  or  members  of  the  set.  A 
set  has  a  defining  property,  and  it  is  used  to  determine  whether  or  not  an  element 
belongs  to  the  set:  To  decide  whether  or  not  x  is  in  the  set  S,  you  need  to  see  whether 
x  satisfies  this  defining  property  p.  The  empty  set  is  the  set  with  no  elements,  and 
is  denoted  by  0. 

Once  we  have  the  defining  property,  there  are  often  several  ways  to  describe 
a  set.  If  there  aren’t  too  many  elements  in  the  set,  then  we  can  list  all  elements: 
B  =  {Benny,  Betty,  Billy, Bobby}.  If  the  elements  come  from  a  well-known  larger  set 
X  and  satisfy  a  defining  property  p,  we  may  write  {x  G  X  :  p(x) }.  This  is  read  “the 
set  of  all  elements  of  X  satisfying  property  p.”  We  may  think  of  X  as  the  universe  in 
this  context. 

Example  6.1.  We  will  frequently  use  sets  such  as  the  set  of  all  even  integers,  the 
set  of  all  odd  integers,  or  the  set  of  integers  that  are  divisible  by  three.  We  will 
denote  these  sets  by  2Z,  2Z+  1,  or  3Z,  respectively.  Define  these  sets  using  the 
format  suggested  above;  that  is,  by  choosing  appropriate  X  and  p  so  that  each  set  is 
described  by  {x  €  X  :  /?(x)}. 
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For  the  set  of  even  integers,  we  write  2Z  =  (x  £  Z  :  x  =  2n  for  some  n  £  Z}.  The 
set  of  odd  integers  can  be  written  as  2Z  +  1  =  {x  £  Z  :  x  =  2n  +  1  for  some  n  £  Z}. 
Finally,  the  set  of  all  integers  that  are  divisible  by  three  is  3Z  =  {x  £  Z  :  x  =  3n  for 
some  n  £  Z}.  O 

Note  that  a  set  is  described  by  its  elements — not  by  the  order  we  put  the  elements 
in  the  set,  or  whether  we  put  an  element  in  more  than  once.  Thus  the  set  { 1 , 2, 3}  is 
the  same  as  the  set  {1, 1,3,2}. 

Exercise  6.2.  For  each  of  the  following  sets,  say  what  the  universe  is  and  write  out 
the  defining  property.  For  example,  if  we  wish  to  describe  the  set  of  all  women,  the 
universe  might  be  all  people,  and  the  defining  property  would  be  “x  is  a  woman.” 
Use  complete  sentences. 

(a)  The  collection  A  of  all  members  of  the  school  band. 

(b)  The  collection  B  of  all  irrational  numbers. 

(c)  The  collection  of  all  prime  numbers  greater  than  or  equal  to  4  and  less  than 

7-  O 

Exercise  6.3.  Care  needs  to  be  used  when  creating  a  defining  property.  What  is 
wrong  with  each  of  the  following? 

(a)  The  collection  C  of  all  pretty  people  in  Luxembourg. 

(b)  The  collection  D  of  all  collections  that  do  not  contain  themselves  as  an  ele¬ 
ment.  O 

The  notation  we  have  described  so  far  in  this  chapter  is  not  the  only  acceptable 
notation.  For  example,  if  we  know  what  our  universe  is,  there  may  be  no  reason  to 
repeat  it  in  the  notation.  Therefore,  we  may  write  {x  £X  :  p(x) },  or  we  may  simply 
write  {x  :  p(x) }.  The  next  exercise  introduces  you  to  a  slightly  different  way  of 
describing  a  set. 

Exercise  6.4.  Let  5  =  {x  £  Z  :  x  =  2n  +  1  for  some  n  £  Z}  and  T  =  {s2  :  s  £  S'}. 
The  notation  for  T  is  different  from  the  notation  we  have  discussed  thus  far  in  the 
chapter,  yet  you  can  still  determine  T.  Write  out  a  description  of  T  using  the  same 
notation  as  the  one  used  for  S.  Then  write  out  a  description  of  S  using  the  same 
notation  as  the  one  used  for  T .  Q 


Exercise  6.5.  Consider  the  set  A  of  nonzero  integers. 

(a)  Write  this  set  using  the  notation  A  =  {x  £  S  :  p{x)}. 

Use  what  you  learned  in  previous  chapters  to  answer  the  following  questions. 

Define  a  new  “multiplication”  on  A  by  x*y  =  2 xy  for  x,y  £  A.  For  parts  (b)  and 

(c)  below,  prove  the  statement  or  give  a  counterexample  to  it.  (If  you  find  you  cannot 
answer  the  questions  below,  read  the  discussion  following  part  (c).) 
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Fig.  6.1  ACB 

(b)  If  x, y  £  A,  then  x*y  £  A. 

(c)  There  exists  an  element  y  £A  such  that  x+y  =  x  for  every  x  £  A. 

If  you  really  can’t  get  started,  then  you  probably  didn’t  understand  the  problem. 
One  way  to  begin  is  to  pick  numbers  for  x  and  y  and  try  them  out  until  you  get  a  feel 
for  this  new  multiplication.  Once  you  understand  it,  try  rewriting  the  statements  so 
that  they  make  sense  to  you.  For  example,  in  (b),  replace  the  conclusion  x*y  by  its 
definition  to  obtain  “If  x,y  £  A,  then  2xy  £  A.”  All  this  should  help.  Remember,  the 
most  important  thing  is  to  get  started.  Q 

A  set  A  is  a  subset  of  a  set  B  or,  equivalently,  A  is  contained  in  B,  if  every 
element  of  A  is  an  element  of  B.  We  will  write  ACBto  indicate  that  A  is  a  subset 
of  B.  This  is  depicted  in  Figure  6.1. 

Notice  that  A  is  always  a  subset  of  itself:  A  C  A.  However,  a  subset  can  also  be 
truly  smaller,  and  we  often  find  it  necessary  to  use  our  notation  to  emphasize  this. 
We  say  that  A  is  a  proper  subset  of  B  if  A  C  B  and  A  /  B,  and  we  will  write  A  CB. 

Showing  that  a  set  A  is  contained  in  another  set  B  turns  out  to  be  one  of  the  most 
important  tasks  in  mathematics.  One  way  to  show  that  a  set  A  is  contained  in  a  set 
B  is  to  do  exactly  what  the  definition  says;  take  an  arbitrary  element  of  the  set  A 
and  then  show  that  this  element  is  in  set  B.  We  need  not  worry  about  whether  or  not 
there  is  an  element  in  the  set  A.  The  definition  of  subset  requires  only  the  implication 
(x  £  A  implies  x  £  B)  to  be  true,  not  the  antecedent  (x  £  A). 

Example  6.6.  In  Exercise  6.4  we  used  two  sets,  S  and  T,  where  S  =  {2 n  +  1  :  n  £  Z} 
and  T  =  {s2  :  s  £  5}.  Show  that  T  C  S.  Is  T  a  proper  subset  of  S? 

Remember,  to  prove  set  inclusion  we  have  to  take  an  arbitrary  element  in  the  set 
T  and  then  show  that  this  element  is  in  the  set  S.  So,  for  our  proof  of  containment, 
we  will  begin  with  x  £  T,  and  attempt  to  end  our  proof  with  x  £  S. 

As  we  will  see  in  future  chapters,  we  can  often  devise  a  plan  for  a  proof  of  this 
type  by  writing  out  what  we  know  (x  £  T )  at  the  top  of  the  page,  and  what  we  want 
to  show  (x  £  S )  at  the  bottom.  You  have  probably  attempted  proving  things  this  way 
before:  you  work  from  the  top  down,  and  from  the  bottom  up.  So  our  plan  might 
look  like 


x  £  T, 
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But  x  £  T  means  that  x  =  s2  for  some  s  6  S,  and  x  £  S  means  that  x  =  2n  +  1  for 
some  n  £  Z.  So  our  plan  (a  few  minutes  later)  might  look  like 

x£  T, 

x  =  s2,  for  some  s  £  S, 
smaller  space 

x  =  2n  +  1 ,  for  some  n  £  Z, 
x  £  S. 

We  keep  filling  things  in,  making  sure  that  each  line  follows  logically  from  the 
previous  one,  until  we  see  how  to  complete  the  proof.  Here’s  what  we  end  up  with. 

Proof.  (Inclusion)  We  claim  that  T  C  S.  Now,  if  x  £  T,  then  x  =  s2  for  some  s  £  S. 
By  the  definition  of  S ,  there  exists  n  £  Z  such  that  s  =  2n  +  1.  Hence  x  =  s2  = 
(2n  +  l)2  =  4b2  +4b+  1  =  2(2b2  +  2b)  +  1.  Now  let  m  =  2b2  +  2b.  Then  m  £  Z  and 
x  =  2m  +  1 .  Therefore  x  £  S.  Thus  T  C  .S',  as  desired. 

(Proper  subset)  In  fact,  T  is  a  proper  subset  of  S.  To  show  this  we  need  to  exhibit 
an  element  that  is  in  S,  but  not  in  T.  Consider  the  number  —1.  Then  —1  =  2(— 1)  + 1 
and  —  1  £  Z.  Thus,  —1  satisfies  the  defining  property  for  S,  so  —  1  £  S.  On  the  other 
hand,  the  elements  of  T  are  squares  of  real  numbers.  Consequently  all  of  them  are 
nonnegative.  Hence  —  IfT,  and  the  inclusion  is  proper.  □ 

If  you  remember  the  result  of  Problem  3.2,  then  you  know  that  you  already 
showed  that  ,y2  is  odd  if  and  only  if  s  is  odd.  If  you  refer  the  reader  to  this  re¬ 
sult  (carefully  referencing  it,  so  the  reader  can  find  it  easily),  you  can  significantly 
shorten  the  proof  of  inclusion.  Given  two  proofs  written  with  equal  clarity  and  in¬ 
sight,  most  people  will  prefer  the  shorter  of  the  two.  If  the  reader  remembers  the 
result,  reading  it  again  may  detract  from  the  proof.  So,  as  long  as  you  tell  the  reader 
what  you  are  using  and  where  to  find  it  if  he  or  she  needs  to,  you  can  (and  should) 
refer  to  previous  results.  O 

We  now  return  to  the  subject  of  this  chapter.  Notice  that  we  just  told  you  how  to 
show  that  a  set  A  is  contained  in  a  set  B.  All  you  need  to  do  is  show  that  for  all  x, 
if  x  £  A,  then  x  £  B.  So  we  also  just  told  you  how  to  show  that  A  is  not  contained  in 
B — negate  the  definition  of  containment. 

Exercise  6.7.  Negate  the  statement:  “For  all  x,  if  x  £  A,  then  x  £  B.”  Q 

Two  sets  are  equal  if  they  have  precisely  the  same  elements.  This  can  be  defined 
a  bit  more  formally  as  follows.  A  set  A  is  equal  to  B,  written  A  =  B,  if  A  C  B  and 
B  C  A.  To  show  that  two  sets  are  equal  is  therefore  a  two-step  task:  First  you  show 
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that  one  set  is  contained  in  the  other  (AC  B).  Then  you  reverse  the  order  of  the  sets 
and  show  inclusion  again  ( B  C  A). 

Note  that  when  A  is  a  subset  of  B  we  use  the  symbol  C,  but  when  x  is  an  element 
of  A  we  use  the  symbol  G.  Choose  your  symbols  carefully  and  don’t  mix  them  up! 
If  x  is  not  in  A,  then  we  write  x  £  A.  If  A  is  not  a  subset  of  B,  then  we  write  A  %B. 

Exercise  6.8.  Write  a  definition  of  set  equality  that  reverts  back  to  membership  in  a 
set,  rather  than  set  containment.  Q 


Example  6.9.  Show  that  { (x,y)  £  R2  :  x2  -  x  =  y2  =  0}  =  { (0, 0) ,  ( 1 , 0) }. 

Devising  a  plan.  According  to  the  definition  of  equality  above,  we  have  to  show 
two  separate  things.  The  first  is  to  show  that  the  set  on  the  left  is  contained  in  the 
set  on  the  right.  For  this  part  of  the  proof,  we  will  begin  with  an  arbitrary  element 
(w,z)  £  {(x,y)  £  R2  :  x2  —  x  =  y2  =  0}  and  we  will  try  to  show  that  (w,z)  =  (0,0) 
or  (w,z)  =  (1,0).  Then  we  must  show  that  the  set  on  the  right  is  contained  in  the  set 
on  the  left.  So  for  this  part,  we  will  begin  with  (a,b)  =  (0,0)  or  (a,b)  =  (1,0)  and 
try  to  show  that  it  is  in  the  set  on  the  left. 

Before  beginning  any  problem,  make  sure  you  understand  exactly  what  kind  of 
objects  are  under  consideration.  In  this  example,  our  elements  are  in  R2,  and  there¬ 
fore  they  are  “points”  with  an  x-  and  a  y-coordinate.  Thus,  when  we  discuss  ele¬ 
ments,  we  will  not  talk  about  an  element  like  “z,”  but  rather  one  of  the  form  (x,y). 
You’ll  progress  more  quickly  if  you  use  the  particular  form  of  the  elements  in  your 
problem. 

Proof.  If  (w,z)  £  { (x. y)  £  R2  :  x2  —  x  =  y2  =  0},  then  w2  —  w  =  w(w—  1)  =  0  and 
z2  =  0.  Hence  w  =  0  or  w  =  1  and  in  both  cases,  z  =  0.  Thus  (w,z)  =  (0, 0)  or  (w,z) 
=  (1,0).  This  implies  that  (w,z)  £  {(0,0),  (1,0)}  and  therefore  we  can  conclude  that 
{(x,y)  €  R2  :x2-x  =  y2  =  0}  C  {(0,0),  (1,0)}. 

If  (w,z)  £  {(0,0),  (1,0)},  then  w  =  0  or  w  =  1  and  thus  w(w—  1)  =  w2  —  w  =  0. 
We  also  have  z  =  0  and  thus  z 2  =  0.  Hence  (w,z)  £  {(-Cy)  £  R2  :  x2  —  x  =  y2  =  0} 
and  therefore  {(0,0),  (1,0)}  C  {(x,y)  £  R2  :  x2  —  x  =  y2  =  0}. 

By  the  definition  of  equality,  {(x,y)  £  R2  :  x2  —  x  =  y2  =  0}  =  {(0,0),  (1,0)}.  □ 

Exercise  6.10.  Let  A  =  {1,3,5},£  =  {3,4,6},C=  {5},  and  D  =  {1,3}.  Which  sets 
are  subsets  of  the  others?  For  which  sets  S  do  we  have  1  £  S  ’  1  f  S'!  Which  sets  are 
not  subsets  of  each  other?  O 


Theorem  6.11.  Let  A  be  a  set.  Then  0  C  A. 

Proof.  We  must  show  that  for  every  x,  if  x  £  0,  then  x  £  A.  Since  there  are  no 
elements  in  the  empty  set,  the  antecedent  is  always  false.  Therefore  the  implication 
is  always  true,  completing  the  proof.  □ 

Suppose  A  is  a  set  and  we  wish  to  show  that  A  =  0.  One  inclusion,  A  C  0,  will 
suffice  because  the  reverse  inclusion  is  always  true.  In  fact,  such  a  proof  will  often  be 
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done  by  assuming  that  there  exists  x  GA  and  then  obtaining  a  contradiction.  When 
showing  that  a  set  A  =  0,  then,  it  will  look  like  we  are  proving  that  two  sets  are  equal 
by  establishing  only  one  containment.  That’s  only  because  the  other  containment  is 
obvious ! 

Exercise  6.12.  Prove  the  following. 

(a)  For  A  =  {x£R:  x2  +x+  1  =  0},  show  that  A  =  0,  using  the  comments  in  the 
preceding  paragraph. 

(b)  For  B  =  {x  G  N  :  3y  G  R,  x  =  y2},  show  that  B  =  N.  O 

We  will  now  present  a  list  of  very  important  definitions,  using  two  sets,  A  and  B, 
to  create  other  sets.  Some  examples  will  be  presented  (briefly)  here,  and  more  can 
be  found  in  the  exercises.  In  what  follows,  we  assume  that  all  variables  x  belong  to 
a  universe,  X. 

The  union  of  two  sets  A  and  B  is  defined  by  A  UB  =  {x  :  x  G  A  or  x  G  B).  For 
example,  if  A  denotes  the  set  of  even  integers,  and  B  denotes  the  set  of  odd  integers, 
then  AUB  =  Z. 

The  intersection  of  A  and  B  is  A  fl-B  =  {x  :  x  G  A  and  x  G  Bj.  If  A  and  B  are  two 
sets  such  that  AHB  =  0,  then  we  say  that  A  and  B  are  disjoint.  For  example,  if  A  is 
the  set  of  even  integers  and  B  is  the  set  of  odd  integers,  then  A  and  B  are  disjoint. 

The  set  difference  of  B  in  A  is  A \B  =  {x  G  A  :  x  ^  Bj.  A  comment  is  in  order  here. 
We  can  never  look  for  objects  “not  in  B”  unless  we  know  where  to  start  looking.  So 
we  use  A  to  tell  us  where  to  look  for  elements  not  in  B.  If  X  is  the  universe,  we  will 
write  Bc  for  X\B.  This  is  referred  to  as  the  complement  of  B.  For  example,  let  A 
be  the  set  of  integers.  If  B  =  Z+,  then  A\B  is  the  set  of  elements  of  A  (integers)  that 
are  not  in  B  (that  are  not  positive  integers).  Thus  A\fi={xgZ:x<0}.  On  the 
other  hand,  if  A  =  N,  then  A\B=  {0}. 

It  is  possible  to  visualize  these  sets  using  a  representation  called  a  Venn  diagram. 
These  diagrams  are  often  helpful  in  sorting  out  the  relationship  between  sets.  The 
universe  is  usually  indicated  by  a  rectangle  containing  the  sets.  The  idea  is  illustrated 
in  Figures  6.2  and  6.3.  You  might  enjoy  the  pretty  Venn  diagrams  in  [93]  and  the 
book  Cogwheels  of  the  Mind:  The  Story  of  Venn  Diagrams,  [24], 

A  word  of  warning:  Be  careful — pictures  can  be  deceiving.  Use  the  Venn  diagram 
to  get  your  intuition  going,  but  check  everything  carefully  using  the  techniques  we 
have  developed  thus  far. 


Fig.  6.2  A  U B  and  AGB 
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Exercise  6.13.  Use  the  sets  in  Exercise  6.10  to  answer  the  following  questions: 
What  is  A\B ?  A\C?  Which  sets  are  disjoint?  If  the  universe  is  {1,2, 3, 4, 5, 6}, 
what  is  Ac?Find  AUB  and  AHB.  O 


Exercise  6.14.  Write  a  definition  of  union  for  three  sets.  Write  a  definition  of  in¬ 
tersection  for  three  sets.  Can  you  write  a  definition  of  set  difference  for  three  sets? 
Why  or  why  not?  Q 


Definitions 

Definition  6.1.  The  empty  set,  denoted  0,  is  the  set  that  contains  no  element. 

Definition  6.2.  The  set  A  is  a  subset  of  set  B ,  denoted  ACB ,  if  for  all  x  it  is  the  case 
that  x  £  A  implies  that  x  £  B. 

Definition  6.3.  The  set  A  is  a  proper  subset  of  set  B ,  denoted  A  C  B,  if  A  C  B  and 
A^B. 

Definition  6.4.  Two  sets  A  and  B  are  equal  if  ,4  C  B  and  B  C  A. 

Definition  6.5.  The  union  of  the  sets  A  and  B  is  the  set  A  U  ,B  =  {x :  x  £  A  or  x  €  B} . 

Definition  6.6.  The  intersection  of  the  sets  A  and  B  is  the  set  A  (T  B  =  {x :  x  £  A  and 
xGB}. 

Definition  6.7.  Two  sets  A  and  B  are  disjoint  if  4  n  B  =  0. 

Definition  6.8.  The  set  difference  of  set  B  in  set  A  is  the  set  A  \  B  =  {x  £  A  :  x  ^  B} . 

Definition  6.9.  If  the  set  X  is  the  universe  and  A  is  a  subset  of  X,  then  the  comple¬ 
ment  of  A  is  the  set  Ac  =  X\A. 
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Solutions  to  Exercises 

Solution  (6.2).  Here  is  the  answer  to  (b):  The  universe  is  the  set  of  all  real  numbers. 

The  defining  property  is  “rgR\ Q.” 

Solution  (6.3). 

(a)  The  adjective  “pretty”  is  subjective  and  it  is  unclear  whether  a  person  from 
Luxembourg  is  a  member  of  the  set  C  or  not. 

(b)  Consider  the  following  question:  Is  the  collection  D  an  element  of  D  or  not? 
If  it  is  an  element  of  D,  then  it  must  satisfy  the  defining  property,  which  says 
that  D  is  not  an  element  of  D\  in  other  words,  in  this  case  it  would  have  to  be 
both  in  the  set  and  not  in  the  set.  On  the  other  hand,  if  D  is  not  an  element 
of  the  collection  D,  then  it  does  just  what  the  defining  property  says.  Thus  it 
must  be  in  the  set  Z);  in  other  words,  in  this  case  it  would  have  to  be  both  in 
the  set  and  not  in  the  set.  Hence,  this  property  is  contradictory. 

Solution  (6.4).  We  can  write  T  =  {x  £  Z  :  x  =  (2 n  +  l)2  for  some  n  £  Z}  and  S 

=  {2 n  +  1  :  n  £  Z}. 

Solution  (6.5).  Let  A  be  the  set  of  nonzero  integers. 

(a)  A  =  {x£  Z  :x^0}. 

(b)  Let  a-  andv  be  elements  of  A.  Then  x*y  =  2xy.  Since  x.y,  and2  are  all  integers, 
x*y  £  Z.  Furthermore,  since  x  and  y  are  elements  of  A,  they  are  nonzero. 
Therefore  x*y  =  2 xy  ^  0.  Consequently  x*y  £  A,  as  desired. 

(c)  This  is  false.  Suppose  to  the  contrary  that  there  were  such  an  element  y  in 
A.  Then  x*y  =  x  for  every  x  £  A.  Choosing  x  =  1,  we  see  that  1  =  1  *y 
=  2(l)(y)  =  2y.  The  only  solution  to  this  equation  is  y  =  1  /2,  which  is  not  an 
integer  and  therefore  not  an  element  of  A.  This  contradiction  shows  that  no 
such  y  can  exist. 

Solution  (6.7).  The  negation  is  “There  exists  an  x  such  that  x£A  and  x  £  BT 

Solution  (6.8).  Two  sets  A  and  B  are  equal  if  for  all  x  we  have  x  £  A  if  and  only  if 

x  £  B. 

Solution  (6.10).  The  following  statements  hold: 

CCA  and  OCA; 

no  other  sets  are  subsets  of  each  other; 

1  €  A,  1  £  D,  1  (£  B,  and  1  £  C. 

Solution  (6.12). 

(a)  Suppose  there  exists  x  £  A.  Then  r€R  and  x2  +x+l  =0.  This  implies  that 
x  =  (— 1  +  \/3i) /2  or  x  =  (—  1  —  V^i) / 2.  In  both  cases,  x  K  leading  to  a 
contradiction.  Hence  A  =  0.  Alternatively,  we  might  recall  the  technique  of 
completing  the  square  to  note  that 
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establishing  a  contradiction. 

(b)  We  need  only  show  that  NC  B  (why?).  If  n  £  N,  then  n  £  K.  and  n  >  0.  Thus 
y  =  \Jn  £  M  and  n=y2.  Hence  y  £  B.  This  shows  that  N  =  B. 


Solution  (6.13).  The  following  statements  hold: 

A\£=  {1, 5},  A\C=  {1,3}; 

sets  B  and  C  are  disjoint,  and  sets  C  and  D  are  disjoint; 
if  the  universe  is  as  given,  then  Ac  =  {2,4,6}; 

AUfi  =  {1,3, 4, 5, 6},  and  A n,B  =  {3}. 


Solution  (6.14).  Let  A .  B,  and  C  be  sets  and  let  the  universe  be  denoted  by  X. 
Then  A  U  B  U  C  =  {x  £  X  :  x  £  A  or  x  £  B  or  x  £  C}  and  A(T.BnC={xeY  : 
x  £  A  and  x  £  B  and  x  £  C}.  While  the  union  and  intersection  of  three  sets  makes 
sense,  the  set  difference  of  three  sets  does  not.  In  order  to  answer  this  question,  we 
would  need  to  reduce  it  to  a  set  difference  of  two  sets  by  including  parentheses.  For 
example,  you  can  define  the  following  set  differences:  ( A\B)\C  and  A\(B\C)  (try 
it!).  Work  out  what  these  last  two  sets  are  when  A,  B,  and  C  are  as  in  Exercise  6.10. 


Spotlight:  Paradoxes 

You  may  already  have  seen  paradoxes  in  mathematics.  For  example,  you  may  have 
seen  Zeno’s  paradoxes,  the  most  famous  of  which  is  the  story  of  Achilles  and  the 
tortoise,  in  your  calculus  class.  Another  well-known  paradox  comes  from  the  fol¬ 
lowing:  what  is  the  sum  of 


1-1  + 1-1  +  1 - 7 

You  might  argue  that  this  sum  should  be(l  —  1)  +  (1  —  1)4 - =  0.  Or,  you  might 

just  as  well  argue  that  this  sum  should  be  1  +  (— 1  +  1)  +  (—1  +  1)  H - =  1.  You 

might  even  argue  (as  Luigi  Guido  Grandi  did  [27,  p.  135])  that  since  the  sums  0  and 
1  are  equally  probable,  the  answer  should  be  the  average  of  0  and  1 ;  in  other  words, 
1  / 2.  This  paradox  forces  us  to  look  closely  at  exactly  what  we  mean  by  summing 
infinitely  many  numbers. 

Betrand  Russell  pointed  out  a  paradox  in  set  theory.  He  also  presented  a  popu¬ 
lar  form  of  this  paradox,  called  the  barber  problem.  The  problem  is  the  following. 
Suppose  there  is  a  town  with  one  barber,  and  this  barber  says  that  he  shaves  those 
people,  and  only  those,  who  do  not  shave  themselves.  The  question  is:  Who  shaves 
the  barber?  (You’ll  recognize  the  set  theoretic  form  of  this  problem  in  Exercise  6.3.) 

Paradoxes  serve  a  very  useful  purpose.  They  point  out  where  the  foundations  of 
mathematics  are  shaky  (or  even  faulty!).  To  learn  more  about  them,  and  how  they 
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have  been  handled,  we  recommend  reading  [26,  Chapter  15],  [57,  Chapter  18],  or 
[59,  Chapter  51], 


Problems 

Problem  6.1.  “Order”  the  following  sets,  using  the  appropriate  symbol  from  among 

=,C,C: 

N,R,Z+,Z,Q. 

Problem*  6.2.  Let  A .  B,  and  C  be  sets.  Prove  that  if  AC  B  and  B  C  C,  then  ACC. 
(We  say  that  set  containment  is  transitive.) 

Problem  6.3.  Recall  that  N  denotes  the  set  of  natural  numbers,  Z  the  set  of  integers, 
and  R  the  set  of  real  numbers. 

(a)  Write  the  phrase  “x  belongs  to  R”  in  symbols. 

(b)  Write  the  phrase  “Z  is  a  proper  subset  of  R”  in  symbols. 

(c)  Write  the  phrase  “If  x  is  an  element  of  Z,  then  x  or  —  x  is  an  element  of  N”  in 
symbols. 

(d)  Use  set  notation  to  describe  the  set  of  squares  of  all  multiples  of  3. 

Problem  6.4.  In  this  problem  our  universe  is  R,  the  set  of  real  numbers. 

(a)  Give  an  example  of  subsets  A  and  B  of  R  that  are  disjoint. 

(b)  Give  an  example  of  subsets  A  and  B  of  R  that  are  not  disjoint  and  find  A\B 
and  B\A. 

(c)  Give  an  example  of  subsets  A  and  B  of  R  such  that  ACB. 

(d)  Give  an  example  of  subsets  A ,  B,  and  C  of  R  such  that 

AU(BflC)  /  (AUB)U(AUC). 

Problem  6.5.  The  universe  in  this  problem  is  R.  Let  A  be  the  closed  interval  [0,2] 
and  let  B  be  the  closed  interval  [—1,1].  Find  A\B,  B\A,  Ac,  Bc,  AcC\Bc,  (AUB), 
and  (AUB)C. 

Problem  6.6.  Find  an  expression  for  each  of  the  shaded  sets  in  the  Venn  diagrams 
of  Figure  6.4. 

Problem  6.7.  (a)  Consider  the  set  S  of  nonzero  real  numbers.  Write  S  in  set  no¬ 

tation. 

(b)  Define  a  new  “multiplication”  on  this  set  by  xVy  =  x/y.  If  x,y  £  S,  is  xCy  £  SI 
Is  there  an  element  y  £  S  such  that  xCy  =  x  for  all  x  £  SI 

(c)  Repeat  parts  (a)  and  (b),  replacing  the  set  S  by  the  set  T  of  negative  real 
numbers. 

(d)  Repeat  parts  (a)  and  (b),  replacing  the  set  S  by  the  set  V  of  nonzero  rational 
numbers. 


6  Sets 


69 


Problem  6.8.  Define  two  sets  A  and  B  as  follows:  A  =  {(2 n  +  l)3  :  n  £  Z}  and 
B  =  {2 n  +  1  :  n  £  Z}. 

(a)  Prove  that  A  C  B. 

(b)  Suppose  we  redefine  A  and  B,  replacing  Z  by  R;  in  other  words,  let  A  = 
{(2 n  +  l)3  :  n  £  R}  and  B  =  {2 n  +  1  :  n  £  R}.  What  is  the  relation  between 
these  two  sets?  State  and  prove  your  answer. 

Problem  6.9.  Find  an  expression  for  each  of  the  shaded  sets  in  the  Venn  diagrams 
of  Figure  6.5. 

Problem  6.10.  Is  the  following  statement  true  or  false:  {0}  =  0?  Why? 

Problem  6.11.  Show  that  {x  G  R  :  x2  —  1  =  0}  =  { 1 ,  —  1 }. 

Problem  6.12.  Let  A  =  {x  £  Z  :  6  divides  x},  B  =  {x  £  Z  :  21  divides  x},  and  C  = 
{x  £  Z  :  42  divides  x} .  Prove  that  AHB  =  C. 

Problem  6.13.  Let  A  =  {(x,y)  £  R2  :x  —  y  =  0},  B  =  {(x,y)  £  R2  :  x+y  =  0},  and 
C  =  {(x,y)  £  R2  :  x2  —  y2  =  0}.  Prove  thatAUfi  =  C. 

Problem  6.14.  Let  A  =  Z,  B  =  {x  £  Z  :  x  =  2n  +  5  for  some  n  £  Z}  and,  C  =  {x  £  Z : 
x  =  —2m  for  some  m  £  Z}.  Prove  that  A\B  =  C. 

Problem  6.15.  Let  S  be  the  set  of  nonzero  real  numbers.  Define  a  new  “addition” 
on  this  set  by  x  jj  y  =  x  +  y  +  1.  If  you  add  two  numbers  in  .S',  do  you  end  up  with  a 
number  in  SI  (In  other  words,  if  x,y  £  S,  is  x  (I  y  £  SI) 

Problem  6.16.  Prove  that  A  =  B  in  each  of  the  following. 

(a)  Let  A  and  B  be  the  sets  defined  by  A  =  {x  £  R  :  sin(7Tx)  =  0}  and  B  =  Z. 

(b)  Letx  £  R.  Define  A  =  {(ax  +  b)/(cx  +  d)  :  a,b,c,d  £  Z  and  cx  +  d  0}  and 
B  =  {(px  +  q)/{rx  +  s)  :  p}q,r,s  £  Q  and  rx  +  s  ^  0}. 

Problem  6.17.  Let  A  =  {x  £  R  :  ax2  +  bx  +  c  =  0  for  some  integers  a,  b,  and  c,  with 
at  least  one  of  a,  b,c  nonzero}  and  let  B  =  {x  G  R  :  px2  +qx+  r  =  0  for  some  rational 
numbers  p,  q,  and  r,  with  at  least  one  of  p.  q ,  r  nonzero}. 

(a)  Prove  that  2  £  A. 


Fig.  6.4 
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Fig.  6.5 


(b)  Prove  that  \fl  £  A. 

(c)  Give  an  example  of  a  real  number  y  such  that  y  ^  A.  (You  do  not  need  to  prove 
thaty  ^  A.) 

(d)  Prove  that  A=B. 

(e)  Prove  that  QCA. 

The  following  problems  deal  with  sets  of  points  in  the  plane.  We  remind  you  of 
the  notation  introduced  in  Chapter  4.  The  set  of  all  points  in  the  plane  is  denoted 
by  M2  =  {(rj):rjel}.  Subsets  of  M2  will  be  studied  in  more  generality  in 
Chapter  9. 

Problem  6.18.  Define  a  set  A  by  A  =  { (x,  y)  £  R2  :  y  ^  0} . 

(a)  Give  a  geometric  description  of  A. 

(b)  Suppose  we  tell  you  that  if  you  have  two  elements  of  this  set  A,  you  can  “add” 
them  according  to  the  following  rule: 

(*,  y)  o  (z,  w)  =  (xw  +  zy,  wy ) . 

The  symbol  +  here  denotes  usual  addition.  Show  that  the  object  that  results 
when  we  add  two  elements  of  our  set  A  is  again  an  object  in  our  set  A. 
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(c)  Continuing,  find  an  element  ( a,b )  in  A  such  that  (a,b)  o  (x,y)  =  (x,y)  for 
every  (x,y)  in  A. 

(d)  This  “new”  addition  probably  looks  somewhat  odd  to  you,  but  you  have  seen 
it  before.  What  is  it? 

Problem  6.19.  In  each  part  of  this  problem,  two  sets,  A  and  B ,  are  defined.  Prove 
that  A  C  B  in  each  of  the  following: 

(a)  A  =  {x2  :x  G  Zj  and  B  =  Z; 

(b)  A  =  R  and  B  =  {2x  :  x  G  R}; 

(c)  A  =  {(x,y)  €  R2  :y=  (5  —  3x)/2}  and  B  =  {(x,y)  eR2  :  2y  +  3x  =  5}. 
Problem  6.20.  Prove  that  {n2  +n  +  1  :  n  G  N}  C  {2 n  +1  :  n  G  N}. 

Problem  6.21.  Let 

A  =  {x  G  N  :  x  <  x2  and  for  all  y  G  Z,  if  y\x,  then  y2\x}. 

Prove  that  A  =  0. 

Problem  6.22.  Prove  that  one  set  is  a  proper  subset  of  the  other  one  in  each  of  the 
following: 

(a)  A  =  {(x,y)  G  K2  :  xy  >  0}  and  B  =  {(x,y)  G  R2  :  x2+y2  >  0}; 

(b)  A  =  0  and  B  =  {(x,y)  G  R2  :  x2  +y2  <  0}. 

Problem  6.23.  Consider  the  sets 

A  =  {(x,y)  G  R2  :  x2+y2  <  1}  and  B  =  {(x,y)  G  R2  :  |x|  +  |y|  <  1}. 

Is  one  of  the  two  sets  properly  contained  in  the  other  one?  Justify  your  answer. 
Problem  6.24.  Define  the  following  three  sets: 

A  =  {(x,y)  G  R2  :  xy  >  0}; 
fi  =  {(x,y)  £  R2  :  y  >  |x|}; 

C  =  {(x,y)  £  R2  :  0  <  x  <  y}. 


Carefully  prove  that  AC\B  =  C. 


Chapter  7 

Operations  on  Sets 


By  an  operation  on  sets  we  mean  the  construction  of  a  new  set  from  the  given  ones. 
As  we  saw  in  the  last  chapter,  these  new  sets  may  be  formed  using  unions,  in¬ 
tersections,  set  differences,  or  complements  of  given  sets.  In  this  section,  we  will 
look  at  many  important  properties  of  operations  on  sets.  We  end  the  chapter  with  a 
summarizing  list  of  identities.  In  the  exercises  and  problems  you  will  be  given  the 
opportunity  to  prove  the  most  important  ones  and  then  commit  them  to  memory,  so 
you  don’t  have  to  re -prove  them  every  time  you  need  them. 

The  Venn  diagrams  introduced  in  the  previous  chapter  can  be  helpful  in  deciding 
what  is  true  and  what  is  false,  and  they  can  be  part  of  understanding  the  problem.  If 
the  arrangement  of  your  sets  in  the  diagram  is  a  so-called  “typical”  one  (see  [103]), 
it  is  even  possible  to  use  the  Venn  diagram  as  a  proof.  However,  in  this  text,  we 
(and  you)  will  use  Venn  diagrams  to  guide  us,  but  we  will  write  our  proofs  without 
relying  on  them.  When  you  prove  these  properties  you  may  not  always  need  to  start 
from  the  definition.  Sometimes  you  can  use  what  you  know,  and  once  you  have 
proven  everything  in  Theorem  7.4,  you  will  know  a  lot. 

The  first  theorem  is  a  good  example  of  a  proof  in  cases.  It  keeps  things  tidy.  Now 
remember,  if  we  use  the  definition  to  show  two  sets  A  and  B  are  equal,  then  we  must 
show  that  if  x  £  A,  then  x  £  B  and  if  x  £  B,  then  x  £  A. 

Theorem  7.1  (The  distributive  property).  Let  A, B,  and  C  be  sets.  Then 

AU(BnC)  =  (AUB)n(AUC). 

Before  reading  the  proof,  let’s  use  Polya’s  method. 

“Understanding  the  problem Draw  two  Venn  diagrams  representing  the  left  and 
right  sides  of  the  equality  above.  Each  diagram  will  have  three  sets,  appropriately 
labeled  A,B ,  and  C.  Shade  in  the  area  described  by  the  left  side  of  the  equation  in 
one  diagram  and  then  shade  the  right  side  in  the  other  diagram.  They  should  look 
the  same.  While  this  should  convince  you  that  you  are  on  the  right  track,  it  is  not 
enough  to  convince  someone  else. 

“Devising  a  plan.”  We  wish  to  show  that  two  sets  are  equal.  Using  the  definition 
of  equality  of  sets,  we  know  that  we  must  show  two  things.  The  first  thing  to  show 

U.  Daepp  and  P.  Gorkin,  Reading,  Writing,  and  Proving:  A  Closer  Look  at  Mathematics,  73 

Undergraduate  Texts  in  Mathematics,  DOI  10.1007/978-1-4419-9479-0  7, 

©  Springer  Science+Business  Media,  LLC  20 1 1 


74 


7  Operations  on  Sets 


is  that  A  U  (fiflC)  C  (AUB)n(AUC).  So  our  first  line  will  begin 

IfxGAU(finC), 

and  our  last  line  (for  this  part  of  the  proof)  will  look  like 

Thus  x  G  (AUB)n(AUC). 

Now  we  just  have  to  figure  out  how  to  get  from  the  first  line  to  the  last  one.  Let’s  fill 
in  some  things,  making  sure  that  each  line  follows  logically  from  the  previous  one. 
Working  down  from  the  top  we  get 

x  g  au  (fine), 
xgAoriSfinC, 

and  working  up  from  the  bottom  leads  to 

rgAUfi  and  x  G  A  U  C, 
xG  (AUfi)n(AUC). 

Looking  at  what  we  are  missing  in  our  proof  suggests  that  we  use  a  proof  in  cases; 
one  that  depends  on  whether  rgAorxgflnC. 

Once  we  are  done  with  the  proof  above,  we  must  show  that  (AUfi)n(AU C) 
C  AU(finC).  We  use  the  same  method  to  devise  our  plan  for  a  proof  of  this  set 
containment:  We  write  down  our  first  line  and  look  to  see  where  it  takes  us.  Then 
we’ll  write  down  our  last  line  and  try  to  figure  out  how  to  get  there.  That  leads  to 

x  G  (AUfi)n(AUC), 

x  G  A  U  fi  and  x  G  A  U  C, 

[stuff] 

x  G  A  or  x  G  fi  n  C, 
x  g  Au  (fine). 

It  looks  like  if  x  G  A,  we  have  our  proof.  But  what  if  x  ^  A?  This  again  suggests  a 
proof  in  cases;  one  that  depends  on  whether  x  G  A  or  x  ^  A.  If  you  see  what  to  do 
now,  you  can  write  up  the  proof.  If  you  still  do  not  see  what  to  do,  continue  using 
this  method  until  you  see  the  solution. 

Once  you  see  the  solution,  fill  in  the  missing  steps  and  write  the  proof  up  care¬ 
fully  using  complete  sentences,  as  we  do  below. 

Proof.  If  x  G  A  U  (fi  n  C),  then  xGA  or  x  G  fi  n  C.  Suppose  first  that  x  G  A.  Then 
x  G  A  U  fi  and  x  G  A  U  C.  In  this  first  case,  we  see  that  x  G  (AUfi)n(AUC).  Now 
suppose  that  xGfinC.  Then  x  G  fi  and  x  G  C.  Since  x  G  fi,  we  see  that  x  G  A  U  fi. 
Since  we  also  have  x  G  C,  we  see  that  x  G  A  U  C.  Therefore,  x  G  (AUfi)n(AUC) 
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in  this  case  as  well.  In  both  cases  x  £  (A  UB)  D  (A  UC)  and  we  may  conclude  that 

Au(Bnc)  c  (AuB)n(Auc). 

To  complete  the  proof,  we  must  now  show  that  (AUB)fl(AUC)  CAU(flnC). 
So  if  x  £  (AUB)n(AUC),  then  xSAUB  and  x  £  A  U  C.  It  is,  once  again,  helpful 
to  break  this  into  two  cases,  since  we  know  that  x  £  A  or  x  ^  A.  Now  if  x  £  A,  then 
x  £  AU  (BflC).  If  x  £  A,  then  the  fact  that  x  £  A  UB  implies  that  x  must  be  in  B. 
Similarly,  the  fact  that  x  £  A U C  implies  that  x  must  be  in  C.  Therefore,  x  £  B  D  C. 
Hence  x  £  A  U  (B  n  C).  In  both  cases  x  £  A  U  (B  n  C)  and  we  may  conclude  that 

(Aus)n(Auc)  cAu(enc). 

Since  we  proved  containment  in  both  directions  we  may  conclude  that  the  two 
sets  are  equal.  □ 

Look  at  the  proof  above.  It  has  complete  sentences,  variables  are  identified,  we 
know  when  we  are  in  one  case  and  then  the  other,  and  we  know  when  the  proof  is 
complete.  You  should  use  the  form  as  a  model,  but  remember  that  each  proof  will 
be  unique. 

The  following  theorem  involves  an  “if  and  only  if”  statement.  Remember  that  an 
“if  and  only  if”  statement  requires  that  you  prove  both  the  “if”  and  the  “only  if.” 

Theorem  7.2.  Let  A  and  B  be  sets.  Then  AUB  =  A  if  and  only  ifB  C  A. 

Proof.  First  we’ll  show  that  if  AUB  =  A,  then  B  C  A.  So  assume  AUB  =  A.  Ifx  £  B, 
then  x  £  AUB.  Using  the  assumption  that  A  L  B  =  A  we  have  x  £  A.  This  shows  that 
BCA. 

Now  we  will  prove  that  if  BCA,  then  AUB  =A.  So  let  us  assume  that  BCA. 
We  must  show  that  AUB  C  A  and  A  CAUB.  To  prove  the  first  containment,  we 
have  that  if  x  £  AUB,  then  x  £  A  or  x  £  B.  If  x  £  A,  then  x  is  where  it  needs  to  be 
and  we  have  nothing  more  to  prove.  If  x  £  B,  then  we  use  the  assumption  that  BCA 
to  conclude  that  x  £  A.  In  both  cases  we  get  x  £  A  and  therefore  have  A  U  BCA. 
To  prove  the  second  containment,  let  x  £  A.  Then  x  £  AUB  and  we  conclude  that 
A  CAUB.  Together  we  have  proven  that  A  U  B  =  A.  □ 

The  stmeture  of  the  proof  of  Theorem  7.2  is  more  complicated  than  the  proof 
of  the  distributive  property.  First,  as  we  said  above,  there  are  two  things  to  prove: 
the  “if”  and  the  “only  if.”  Next,  both  of  these  statements  have  hypotheses  and  con¬ 
clusions.  In  each  case,  you  must  be  aware  of  what  you  are  assuming  and  what  you 
are  proving.  What’s  even  more  important,  though,  is  that  you  use  what  you  are  as¬ 
suming  to  get  to  your  desired  conclusion.  If  you  don’t  use  your  assumption,  then 
your  original  statement  was  poorly  constructed,  you  proved  more  than  you  thought 
you  did,  or  your  proof  was  in  error.  In  fact,  in  the  proof  above,  we  did  not  use  our 
assumption  that  B  C  A  to  prove  A  CAUB.  Did  we  make  an  error,  or  did  we  prove 
more  than  we  said  we  did? 

Now  that  you  have  seen  two  examples  of  how  to  write  such  a  proof,  it  is  time  for 
you  to  try  it  by  yourself.  Try  proving  one  of  the  two  DeMorgan’s  laws  below. 

Exercise  7.3.  Let  A  and  B  be  subsets  of  the  set  X.  Then 
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X\(AUB)  =  (X\A)n(X\B). 


(a)  Devise  your  plan.  (Include  a  Venn  diagram.) 

(b)  Write  up  your  proof.  Q 

We  now  give  the  promised  list  of  some  of  the  properties  of  set  operations.  We 
proved  three  of  them  above.  In  the  problems  you  will  be  asked  to  work  more  of  the 
proofs. 

Theorem  7.4.  Let  X  denote  a  set,  and  A,B,  and  C  denote  subsets  ofX.  Then 

1.  0C  A  and  AC  A. 

2.  (. Acf=A . 

3.  A  U  0  =  A. 

4.  AH0  =  0. 

5.  AHA  =  A. 

6.  AUA  =  A. 

7.  AnB  =  BnA.  ( Commutative  property) 

8.  AUB  =  BU  A.  ( Commutative  property) 

9.  (AUB)UC  =  AU  (BUC).  (Associative property ) 

10.  (An B)  DC  =  An  {BUC).  (Associative property) 

11.  AUB  C  A. 

12.  AC  AUB. 

13.  AU(BnC)  =  (AUB)n(AUC).  ( Distributive  property) 

14.  An(BUC)  =  (AnB)U(AnC).  ( Distributive  property) 

15.  X\{AUB)  =  {X\A)U{X\B).  (DeMorgan’s  law) 

(When  X  is  the  universe  we  also  write  (A  UB)C  =  Ac  UBC.) 

16.  X\{AUB)  =  (X  \  A)  U  (X\B).  (DeMorgan’s  law) 

(When  X  is  the  universe  we  also  write  (A  nB)e  =  ACUBC.) 

17.  A\B=AUBC. 

18.  ACB  if  and  only  if  (X\B)C(X\A). 

(When  X  is  the  universe  we  also  write  ACB  if  and  only  ifBc  C  Ac.) 

19.  ACC  and  B  CC  if  and  only  if  A  U  B  C  C. 

20.  CCA  and  C  C  B  if  and  only  if  C  CAC\B. 

21.  AUB  =  A  if  and  only  if  B  C  A. 

22.  AUB  =  B  if  and  only  ifB  C  A. 

Many  results  can  be  proved  using  the  methods  demonstrated  thus  far  in  this  chap¬ 
ter.  Once  you  have  proven  these  statements,  though,  it  is  a  good  idea  to  use  them  in 
other  proofs.  Practice  using  the  results  in  Theorem  7.4  in  the  next  exercise. 

Exercise  7.5.  Let  A,B,  and  C  be  sets.  Prove  the  following  using  relevant  statements 
from  Theorem  7.4:  If  Cc  C  B.  then  (A\B)UC  =  C.  Q 
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Definition 

Definition  7.1  (for  Problems  7.6  through  7.10).  The  symmetric  difference  of  sets 
A  and  B  is  the  set  AAB  =  (A\B)U  ( B\A ). 


Solutions  to  Exercises 

Solution  (7.3).  First  we  show  that 

X\{AUB)  C  (X\A)n(X\,B). 

If  a-  £  X\  (AUB),  then  x  (/-_  A  U  B.  Therefore  x  A  and  x  f/  B.  Consequently,  x  £  X\A 
and  x  £  X  \  B.  Thus  x  £  (X  \  A)  IT  (X  \  B) .  We  conclude  that  X  \  (A  U  Z?)  C  (X  \  A) 

n(x\B). 

We  now  show  that 


(X\A)n(X\B)CX\(AUB) 


If  x  £  (X  \A)  tl  (X  \B),  then  x  £  X  \  A  and  x  £  X  \ B.  Thus,  x  £  X  and  x  ^  A,  and 
x  £  X  and  x^B.  So,  x  £  X  and  x  $  A  and  x  B.  This  implies  that  x  £  X  and  x^AUB. 
Therefore,  x  £  X\  (AUB),  and  we  see  that  (X\A)  (T  (X\B)  CX\(AUB).  Thus,  the 
two  sets  are  equal. 

Solution  (7.5).  Since  Cc  C  B,  statements  18  and  2  of  Theorem  7.4  imply  Bc  C  C, 
and  thus  Bc  UC  =  C  by  statement  19  of  the  same  theorem.  The  rest  of  the  proof 
now  follows  from  the  following  string  of  equalities  (numbers  indicate  the  relevant 
statements  from  Theorem  7.4): 


(A  \  Z?)  U C  =  (A  fl Be)  U C  (by  17) 

=  (A  U  C)  H  ( Bc  U  C)  (by  8  and  13) 


(Auc)nc 

c 


(since  BCUC  =  C  as  shown) 
(by  8,  12,  and  20). 


Problems 

In  all  the  problems  below,  X  denotes  a  set;  A,B,  and  C  denote  subsets  ofX. 
Problem  7.1.  In  this  problem  we  refer  to  statements  of  Theorem  7.4. 

(a)  Prove  statement  2. 

(b)  Prove  statement  14. 

(c)  Prove  statement  16. 

(d)  Prove  statement  18. 
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(e)  Prove  statement  20. 

(f)  Prove  statement  22. 
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Problem  7.2.  Prove  that  A  (IB  =  0  if  and  only  if  B  C  (X  \  A). 

Problem  7.3.  Prove  that  A  =  B  if  and  only  if  ( X \A)  =  (X\B).  Make  sure  you  use 
statements  from  Theorem  7.4  rather  than  going  back  to  the  definition. 

Problem  7.4.  Prove  the  following  using  the  results  stated  in  Theorem  7.4: 

(a)  (AUB)nB  =  B\ 

(b)  (AnB)UB  =  B. 

Problem  7.5.  Show  that  (A\B)j(BnAc)  =  (A  U  B)  \  (B DA )  in  two  differents  ways, 
by  completing  the  two  parts  below. 

(a)  Prove  this  equality  using  only  the  definition  of  set  containment. 

(b)  Using  theorems  from  the  text,  give  a  different  proof  of  the  equality  above.  If 
you  use  a  result  that  has  a  name,  state  the  name  of  the  result. 

Problem  7.6.  Draw  the  Venn  diagram  for  the  symmetric  difference  A  AB  of  two  sets 
A  and  B  and  prove  that 

AAB  =  (AUB)  \  (APB). 

Problem  7.7.  For  sets  A  and  B  prove  the  following. 

(a)  AAA  =  0; 

(b)  AA0  =  A; 

(c)  AAB  =  BAA.  (The  symmetric  set  difference  is  commutative.) 

Problem  7.8.  Prove  that  for  sets  A,  B,  and  C,  we  have  (AAB) AC  =  AA(BAC). 
(The  symmetric  set  difference  is  associative.) 

Problem  7.9.  Prove  that  for  sets  A  and  B,  we  have  AAB  =  A\B  if  and  only  if  B  C  A. 

Problem  7.10.  Prove  that  for  sets  A  and  B,  we  have  (A  U  B)A(A  P  B)  =  AAB. 

Problem  7.11.  Sketch  Venn  diagrams  of  the  set  on  the  left  and  the  set  on  the  right 
side  of  the  equation 


(A\(flnc))u(fl\c)  =  (Aufi)\(snc). 

Once  you  have  done  that,  prove  that  the  equality  above  holds. 
Problem  7.12.  Consider  the  following  sets: 


(i)  A\  (AUBUC), 

(ii)  A\ACflnC, 

(iii)  AnBc  ncc, 

(iv)  A\(BUC),and 

(v)  (A  \  z?)  n  (A  \  c) . 
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Fig.  7.1 

(a)  Which  of  the  sets  above  are  written  ambiguously,  if  any? 

(b)  Of  the  ones  that  make  sense,  which  of  the  sets  above  agree  with  the  shaded 
set  in  Figure  7.1? 

(c)  Prove  that  A  \  (B  U  C)  =  (A  \  B)  D  (A  \  C) . 

Problem  7.13.  Consider  the  following  sets: 

(i)  (Anfi)\(AnBflC), 

(ii)  Anfi\(Anflnc), 

(iii)  Anenc, 

(iv)  (Anfi)\C,  and 

(v)  (A\C)n(B\C). 

(a)  Which  of  the  sets  above  are  written  ambiguously,  if  any? 

(b)  Of  the  sets  above  that  make  sense,  which  ones  equal  the  set  sketched  in  Fig¬ 
ure  7.2? 

(c)  Prove  that  (A  D  B)  \  C  =  (A  \  C)  O  (B  \  C) . 

Problem#  7.14.  In  this  problem  you  will  prove  that  the  union  of  two  sets  can  be 
rewritten  as  the  union  of  two  disjoint  sets. 

(a)  Prove  that  the  two  sets  A  \  B  and  B  are  disjoint. 

(b)  Prove  that  AUB  =  (A\B)UB. 

Problem  7.15.  Prove  that  Ac  U  Bc  =  X  if  and  only  if  A  and  B  are  disjoint. 


Fig.  7.2 
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Problem  7.16.  Prove  that  (A  \  B)  U  (B  \  C)  C  A  \  C  if  and  only  ifADCCAflB  and 
B<1CCCA. 

Problem  7.17.  Prove  or  disprove:  If  A  U  B  =  A  U  C,  then  B  =  C. 

Problem  7.18.  Prove  or  give  a  counterexample  for  the  following  statement. 

Let  X  be  the  universe  and  A,  C  X.  If  A  fl  L  =  B  D  Y  for  all  Y  CX,  then  A  =  B. 

Problem  7.19.  Prove  that  A  fl  (Bc  n  Cc)  =  0  if  and  only  if  ,4  C  BUC. 

Problem  7.20.  Prove  that  (A  U  B)  \  (C  U  D)  =  (A  \  (C  U  D) )  U  (B  \  (C  U  D) ) . 


Chapter  8 

More  on  Operations  on  Sets 


Most  of  what  we  did  in  the  last  two  chapters  was  concerned  with  operations  on  two 
sets.  In  Exercise  6.14  we  defined  unions  and  intersections  of  three  sets.  In  general, 
we  may  have  two  or  three  sets,  as  many  sets  as  there  are  integers,  or  even  more  sets 
than  that.  We’ll  need  a  new  definition  and  special  notation.  In  this  chapter,  we  will 
introduce  the  notation  that  will  allow  us  to  keep  track  of  these  sets.  Unfortunately,  a 
rigorous  definition  will  have  to  wait  until  Chapter  14. 

Let  n  be  a  positive  integer  and  suppose  that  we  have  sets  ,4 1  ./U, . . .  ,A„.  How  can 
we  talk  about  the  union  of  these  n  sets?  the  intersection?  For  example,  when  we 
have  three  sets,  if  we  write  |j]=i  Ay  =  A  i  UA2  UA3,  we  would  be  referring  to  the  set 
of  x  in  our  universe  that  lie  in  at  least  one  of  our  sets,  A\,  A2,  or  A3.  Of  course,  there 
is  nothing  special  about  three  sets;  that  is,  for  every  positive  integer,  n,  we  can  write 

n  n 

U  Aj  =AiUA2U---UA„  and  Q  Aj  =A  1  nA2n---OA„. 
j=  1  J=  1 

The  first  set  would  be  the  set  of  all  x  in  the  universe  that  lie  in  at  least  one  of  the  A ; 
for  j  =  1,2, ...  ,n,  while  the  second  would  be  the  set  of  x  that  lie  in  all  of  the  sets 
Aj.  If  we  have  a  set  A;-  for  each  positive  integer  j  and  we  want  to  take  the  union  and 
intersection  over  all  positive  integers,  then  we  write 

(J  Aj  =Aj  UA2  U  •  •  •  and  Q  Aj  =  Aj  OA2  (T  •  •  •  . 

j= l  j= 1 

This  is  probably  a  good  time  to  look  at  some  examples. 

Example  8.1.  We  can  write  the  union  in  different  ways.  For  example, 

10 

U  [0, 7]  =  [0, 1]  U  [0,2]  U  •  -  -  U  [0, 10]  =  [0, 10]. 
j=  1 


Similarly, 


U.  Daepp  and  P.  Gorkin,  Reading,  Writing,  and  Proving:  A  Closer  Look  at  Mathematics , 
Undergraduate  Texts  in  Mathematics,  DOI  10. 1007/978-1 -44 19-9479-0  8, 

©  Springer  Science+Business  Media,  LLC  20 1 1 


81 


82 


8  More  on  Operations  on  Sets 


10 

f|  [0.7]  =  [0, 1]  n  [0,2]  n-n  [o,io]  =  [o,i], 

j=  i  u 

In  Example  8.1,  we  had  unions  and  intersections  of  finitely  many  sets  (ten,  to 
be  precise).  We  now  take  a  look  at  what  can  happen  when  we  take  unions  and 
intersections  of  even  more  sets. 

Example  8.2.  (a)  For  each  q  G  Z+  define  the  set  Aq  =  {p / q  :  p  G  Z}.  These  sets 

can  be  used  to  define  the  union  \Jqei  +  Aq. 

(b)  This  time  we  define,  for  each  i  G  N,  the  set  B,  =  {p/2>\p  G  Z}.  These  sets 
may  be  used  to  define  the  intersection  fj(-epjB,-.  Q 

Exercise  8.3.  Write  the  sets  UT=i  L j>  3  +  1]  an(f  nj=i  [. j,j  +  1]  in  their  simplest  form, 
by  listing  the  first  few  sets  in  the  union  or  intersection  until  the  pattern  is  established, 
and  then  stating  your  guess.  (You  don’t  have  to  prove  that  your  guess  is  correct.)  Q 


Let  X  denote  our  universe  and  let  g/  be  a  collection  of  subsets  of  X.  Then  the 

union  of  the  collection  and  intersection  of  the  collection  are  defined  just  as  you 
might  expect:  The  union  is  defined  by 

P  A  =  {x  G  X  :  x  G  A  for  some  A  G  gfj 

AesX 

and,  for  a  nonempty  collection  g/,  the  intersection  by 

P|  A  =  {x  GX  :  x  G  A  for  all  A  G  g/j . 

In  some  situations,  it  will  be  helpful  to  label  our  sets.  We  often  label  objects  in 
real  life;  for  example,  we  label  runners  in  order  of  their  finish  in  a  race.  Why  not  do 
this  with  sets?  We  could  label  the  set  we  think  of  first  as  A\,  the  runner-up  would 
be  set  A2,  and  so  on.  Sometimes  this  is  possible,  as  in  Example  8.2  above,  but  other 
times  it  is  not.  For  example,  sometimes  we  do  not  know  how  many  sets  we  have  and 
other  times  we  will  have  so  many  sets  that  it  is  impossible  to  tell  which  “came  first.” 
While  this  may  seem  odd,  it  happens  all  the  time. 

So  again  let  X  be  our  universe  and  suppose  we  have  a  set  I.  Suppose  further  that 
for  each  a  G  /  there  is  a  unique  setAa  C  X  corresponding  to  it.  The  set  I  is  called  an 
index  set,  each  a  G  /  is  called  an  index,  and  the  set  {Aa  :  a  G 1}  is  called  an  indexed 
set  or  an  indexed  collection.  Thus,  an  index  set  is  a  set  that  labels  the  members  of 
another  set  g/.  We  may  also  write  {Aa  }0.c-i  for  an  indexed  collection  of  sets. 

In  this  setting,  our  definition  of  union  and  intersection  will  look  a  bit  different 
from  the  one  presented  above.  So  recall  the  notation:  X  denotes  our  universe  and  we 
let  {A a  :  cc  G  /}  be  an  indexed  collection  of  sets  with  AnCX  for  all  a  in  an  index 
set  I.  Then  the  union  of  the  indexed  collection  {Aa  :  a  G  1}  is  just 
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[J  Aa  =  {x  £  X  :x£Aa  for  some  a  £  /}, 
ael 

and  for  /  /  0,  the  intersection  of  the  indexed  collection  { A  a  :  a  £  1}  is 

P|  Aa  =  {x  £  X  :  x  £  Aa  for  all  a  £  1} . 
ael 

Exercise  8.4.  Find  the  simplest  way  to  describe  the  following  sets  (you  may  find 
sketches  helpful): 

(a)  UjteM+(0>*); 

(b)  U«6n[M; 

(c)  n„6N[0,n].  o 

Note  that  the  index  notation  and  the  general  definition  of  union  and  intersection 
given  here  include  the  cases  in  Chapter  6  and  the  ones  we  mentioned  in  the  begin¬ 
ning  of  this  chapter.  For  instance,  if  /  =  {1,2},  then  f} ,-e/A;  =A \  IIA2. 

Some  more  practice  with  this  notation  will  probably  be  very  helpful  at  this  point. 

Exercise  8.5.  (a)  Write  (J7=o[P:  j]  using  an  appropriate  index  set. 

(b)  Write  (J7=i  (0 ,j)  using  an  appropriate  index  set.  O 

Some  sets  are  more  easily  described  with  index  notation,  others  without  such 
notation.  Let’s  go  back  and  look  at  the  sets  in  Example  8.2. 

Example  8.6.  Consider  the  indexed  collection  of  sets  {Aq}q 6Z+  defined  in  Exam¬ 
ple  8.2  (a).  Then  \JqeIl+  Aq  =  Q. 

Proof.  If  x  £  Uqez+Aq,  then  x  £  Aq  for  some  q  £  Z+ .  Thus,  there  exist  q  £  Z' f  and 
p  £  Z  such  that  x  =  p/q.  Hence  x  £  Q,  and  we  have  shown  that  U(/ez  +Aq  C  Q. 

Conversely,  if  x  £  Q,  then  x  =  p/q  for  some  p,q  £  Z  with  q  0.  Now  (for 
reasons  that  you  will  explain)  we  may  choose  q  so  that  q  >  0.  For  this  q  we  have 
q  £  Z+  and  therefore  x£Aq.  Hence  x  £  U9gz+  Ay  So,  Q  C  Uf/ez+  Aq,  and  therefore 

U,eZ+^  =  Q-  ^  □ 

Example  8.7.  Consider  the  indexed  collection  of  sets  defined  in  Exam¬ 

ple  8.2  (b).  We  claim  that  f)ieNBj  =  Z. 

Proof.  If  x  £  n/eK  If.  then  x  £  B,  for  all  i  £  N.  In  particular,  x  £  Bq.  Therefore 
x  =  p/ 3°  =  p  for  some  p  £  Z.  So  x  £  Z,  and  thus  H/gN Bj  Q  Z. 

Now  let  x  £  Z.  For  each  i  £  N,  we  may  write  x  =  ( 3!x)/3 '.  Of  course,  3'x  £  Z, 
since  x  £  Z.  Hence  x  £  B,  for  all  i  £  N.  This  shows  that  x  £  D/gN-®*  al'b  therefore 

zcna«i. 

Combining  the  two  arguments  we  obtain  the  desired  equality,  =  Z.  □ 
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Exercise  8.8.  What’s  the  difference  between  “an  infinite  union  of  sets”  and  “a  union 
of  infinite  sets”?  Give  an  example  of  each,  showing  how  these  two  phrases  differ. 
(While  we  haven’t  given  a  rigorous  definition  of  infinite  here,  your  intuition  should 
suffice  to  solve  this  problem.)  O 

You  already  know  that  one  of  DeMorgan’s  laws  for  two  sets  can  be  stated  as 

X\(AUB)  =  (X  \A)  ("I  (X\B). 

This  can  be  rephrased  in  words  as  “the  complement  of  a  union  is  the  intersection  of 
the  complements.”  DeMorgan’s  laws  do  not  depend  on  the  number  of  sets  that  we 
have,  and  that  is  the  point  of  the  next  exercise. 

Exercise  8.9.  Show  that  the  general  DeMorgan’s  laws  hold:  For  every  universe  X, 
nonempty  index  set  I,  and  indexed  collection  of  sets  {Aa  :  a  £  /},  we  have 

(0  X  \  UaG/^a  =  riae/C^ \^a)  anc* 

(ii)  X  \  —  Uae/  (X\4).  o 

Exercise  8.10.  Suppose  Aa  C  B  for  each  a  £  I.  Show  that  (J aeiAa  Cfi.  O 


Definitions 

Definition  8.1.  Let  X  denote  the  universe  and  ,&/  a  collection  of  subsets  of  X.  Then 

the  union  of  the  collection  is  defined  as 

P  A  =  {x  £  X  :  x  £  A  for  some  A  £  g/} . 

Definition  8.2.  Let  X  denote  the  universe  and  .e/  a  nonempty  collection  of  subsets 
of  X.  Then  the  intersection  of  the  collection  is  defined  as 

P|  A  =  {x  £X  \  x  £  A  for  all  A  £  g/J . 

Aesrf 

Definition  8.3.  A  set  I  is  called  an  index  set  for  a  set  gf,  if  for  each  a  £  I  there  is 
an  element  Aa  £  ■&/  corresponding  to  a,  and  every  element  in  g/  is  labeled  in  this 
way. 

Definition  8.4.  Given  an  index  set  I  and  corresponding  sets  Aa  that  are  defined  for 
each  a  £l,  the  set  {Aa  :  a  £  1}  is  said  to  be  an  indexed  set  or  indexed  collection 
of  sets. 

Definition  8.5  (for  Problem  8.18).  A  collection  of  sets  g/  is  said  to  be  pairwise 

disjoint  if  the  following  is  satisfied:  For  all  X,Y  £  g/,  if  X  C\Y  ^  0,  then  X —  Y . 
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Solutions  to  Exercises 

Solution  (8.3).  You  can  see  that  UJ=i  \j,j  +  1]  =  [1,2]  U  [2,3]  U  [3,4]  U  •  •  •  =  [1,°°) 
and  n;=i  \j,  j +  1]  =  [1 , 2]  n  [2, 3]  n  [3,4]  n  •  •  •  =  0. 

Solution  (8.4).  You  can  check  the  following: 

(a)  IUm+(0,x)  =  (0,°°); 

(b)  U„eN[0,»]  =  [0,°°); 

(c)  a6N[0,«]  =  {0}. 

Solution  (8.5).  You  can  check  the  following: 

(a)  U7=o[Oj=U,eN[Oj; 

(b)  U;=1(0,7)  =  U;6z+(0,7)- 

Solution  (8.8).  An  infinite  union  of  sets  would  mean  that  we  take  the  union  over 
infinitely  many  sets  (but  the  sets  themselves  may  be  finite);  in  other  words,  the  index 
set  is  infinite.  For  example,  UaeN't”}  would  be  an  infinite  union  of  (finite)  sets.  On 
the  other  hand,  a  union  of  infinite  sets  would  mean  that  the  sets  themselves  must 
be  infinite  (while  the  index  set  may  be  finite).  For  example,  the  union  of  the  even 
integers,  2Z,  with  the  odd  integers,  2Z  + 1,  would  be  a  union  of  two  infinite  sets. 

Solution  (8.9).  We  will  show  part  (i)  and  will  leave  part  (ii)  for  you  to  do.  So  we 
need  to  show  that 

x\(Ua«)= n(*\A«)- 

ael  ael 

If  x  £  X  \  (UaG/^a).  then  x  £  X  and  x  ^  U aei^a-  By  the  definition  of  union  this 
means  that  x  £  X  and  x  (fi  Aa  for  every  a  £  I.  Hence,  x  £  X  \Aa  for  all  a  £  I,  and 
therefore  x  £  naeI(X\Aa).  Thus,  X  \  (UaG/^a)  £  Dae/CA^a)- 

Now  if  x  £  pi  ctei(X  \  Aa),  then  x  £  X  \Aa  for  all  a  £  I.  This  implies  that  x  £X 
and  x  ^  A a  for  every  a  £  I.  Hence,  we  have  x  £  X  and  x  0  \JaclAa.  It  follows  that 
x  £X\  (Uc (eIAa)  and  thus  f\aeI(X \Aa)  CX\ (U«G/^a)- 

The  two  subset  relations  give  the  desired  equality  between  the  sets. 

Solution  (8.10).  If  x  £  U aeiA-a,  then  there  exists  ao  such  that  x  £  A :k) .  Since  we 
suppose  that  A ^  C  B,  we  know  that  x  £  B.  Thus  U aeiAa  C  B. 


Problems 

Problem  8.1.  For  positive  integers  n,  consider  the  intervals  of  real  numbers  given 
by  An  =  [0,  1/h),  Bn  =  [0,1/n],  and  Cn  =  (0, 1/n). 

(a)  FindU,T=iAn,  U:=iB,„and  U^tQ. 

(b)  Findn“=iA„,  n~=]  B„,  and  fln=t 

(c)  Does  UneN^n  make  sense?  Why  or  why  not? 
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Problem  8.2.  If  Ax  =  [—x,x],  find  Ujces+  and  flveR+^t- 
Problem  8.3.  Find  simpler  notation  for  the  two  sets 

A  =  and  fi=n  (R\UJ+1))- 

j= o  je  z 

Problem  8.4.  Let  7  be  a  nonempty  index  set  and  suppose  that  B  C  Aa  for  all  a  £  I. 
Show  that  B  —  fl  aeIAcc- 

Problem  8.5.  Let  7^0  and  {Aa  :  a  £  7}  be  an  indexed  collection.  Prove  that  for  all 

P  £  I 

P  A  p  (=  P  Aa. 
ael  ael 

Problem  8.6.  Prove  or  give  a  counterexample:  Let  {An  :  n  £  Z+}  and  {B„  :  n  £  Z+} 
be  two  indexed  collections  of  sets.  If  An  C  Bn  for  all  n  £  Z+,  then 

D  An  c  f|B»- 

71—  1  71=1 


(Recall  that  A  C  B  means  strict  inclusion;  that  is,  A  C  B  and  A  ^  B.) 

Problem  8.7.  Let  7  be  a  set  and  let  {Aa  :  a  £  7},  and  {Ba  :  a  £  7}  be  two  indexed 
collections  of  sets  such  that  Aa  C  Ba  for  all  a  £  I.  Prove  that 

P  Aa  C  p  Ba. 

ael  ael 

Problem  8.8.  Prove  the  following  set  inclusion. 

U  {(tj)el2:r+pii}C  Pl  {(rj)et2:r+y>i}. 

be  R+  s€R“ 

Problem  8.9.  For  n  £  Z+  define 

1  3 

A,  =  {i£l:  -  <x<  2}  and  B„  =  {x  £  M. :  0  <  x  <  -  +  2} . 
n  n 

Prove  that  p  A„C  n®- 

nez+  nelA 

Problem  8.10.  Let  7  and  J  be  nonempty  sets  such  that  /  C  7,  and  let  \  Aa  :  a  £  7} 
be  an  indexed  collection.  Prove  that 

(a)  V^aejAa  Q  U aei^a  and 

(b)  fl ael-Aa  Q  fl aejAa- 

Problem  8.11.  Let  {A,-  :rgR}  and  {B,  :  r  £  M}  be  two  indexed  collections  of  sets. 
Prove  that 
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fK  U  fH  <=  pl^rUfir). 

\ri  i  /  \r(.  i  )  reR 

Provide  an  example  showing  that  this  inclusion  can  be  proper. 

Problem*  8.12.  Let  I  be  a  nonempty  set,  {Aa  :  a  £  1}  an  indexed  collection  of  sets, 
and  let  B  be  a  set. 

(a)  Prove  the  distributive  property: 


u  Aa  nfi=  U  (AanB). 

\ael  J  ael 

(b)  State  and  prove  a  distributive  property  for  (flae/^a)  U B. 

Problem*  8.13.  Suppose  that  {Aa  :  a  £  7}  is  a  nonempty  indexed  collection  of  sub¬ 
sets  of  a  set  X. 

(a)  If  Aag  =  0  for  some  Oo  €  /,  prove  that  f| aeiAa  =  0. 

(b)  If  A  aQ=X  for  some  ocq  £  /,  prove  that  U  aei-^a  =  %■ 


Problem  8.14.  Define 

A  =  R\  P|  (M\ {— n,  —  n+ 1, •  •  •  ,0, . . . ,n—  l,n}) . 

neiA 

The  set  A  should  be  familiar  to  you.  Guess  what  it  is  and  then  prove  that  your  guess 
is  correct. 

Problem  8.15.  Guess  a  simpler  way  to  express  the  set  A  defined  as 

A  =  Q\p(K\{2n}), 

nZjj 

and  then  prove  that  your  guess  is  correct. 

Problem  8.16.  Suppose  that  X  is  a  set  with  more  than  one  element.  What  is 
U.vgxM?  What  is  fWM? 

Problem  8.17.  For  every  aeR  and  for  every  m.n  £  Z+  define 

Aa  =  {x  £  Z+ :  x  <  a}  and  Bmn  =  {x  £  M  :  \x  —  m\  <  -}. 

n 


u 

aeR  mGZ+ 


Prove  that 
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Problem*  8.18.  A  collection  of  sets  sY  is  said  to  be  pairwise  disjoint  if  the  follow¬ 
ing  is  satisfied:  For  all  X,Y  £  sY ,  if  Iflf  ^  0,  then  A  =  F. 

A  comment  about  this  definition  may  be  in  order:  Speaking  informally,  a  collec¬ 
tion  of  sets  is  pairwise  disjoint  if  whenever  we  choose  two  sets  from  the  collection, 
they  are  disjoint  or  they  are  equal. 

(a)  Give  an  example  of  a  pairwise  disjoint  collection  of  infinitely  many  sets. 

(b)  What  is  the  contrapositive  of  “if  X  D  Y  ^  0,  then  X  =  F”? 

(c)  What  is  the  converse  of  “if  X  D  F  ^  0,  then  X  =  F”? 

(d)  If .'//  is  a  pairwise  disjoint  collection  of  sets,  does  the  assertion  you  found  in 
(b)  hold  for  all  X,  Y  £  sYl 

(e)  If  the  assertion  that  you  found  in  (b)  holds  for  all  X  and  F  in  some  set  sY ,  is 
sY  a  pairwise  disjoint  collection  of  sets? 

(f)  Suppose  that  SS  is  a  pairwise  disjoint  collection  of  sets.  Can  we  conclude  that 

C\xe@x  =  ®- 

(g)  Suppose  that  =  0.  Is  38  necessarily  a  pairwise  disjoint  collection  of 

sets? 

Problem  8.19.  For  n  £  N  we  define  A„  =  {x  £  N  :  |  —  1  <  x  <  "  }.  We  use  these  sets 
for  two  collections  of  sets:  sY  =  {An  :  n  £  N}  and  S3  =  {An  :  n  £  2N}  (where  2N 
denotes  the  set  of  even  nonnegative  integers). 

(a)  Is  the  collection  sY  pairwise  disjoint?  Prove  or  give  a  counterexample. 

(b)  Is  the  collection  38  pairwise  disjoint?  Prove  or  give  a  counterexample. 

(c)  Is  sY  contained  in  381  Are  the  sets  equal? 

Problem  8.20.  An  indexed  collection  of  sets  {An  :  n  £  Z+}  is  said  to  be  increasing 
if  An  C  An+ 1  for  each  n  £  Z+.  The  indexed  collection  is  said  to  be  strictly  increasing 
if  A„  C  An+ 1  for  each  n  £  Z+. 

(a)  Give  an  example  of  an  increasing  indexed  collection  of  sets  (consisting  of 
subsets  of  R)  that  is  not  strictly  increasing. 

(b)  Find  an  increasing  indexed  collection  of  sets  {A„  :  n  £  Z+}  (consisting  of 
subsets  of  R)  such  that  UneZ+An  =  [0, 1]  and  nneZ+A„  =  {0}. 

(c)  Find  a  strictly  increasing  indexed  collection  of  sets  {B„  :  n  £  Z+}  (consisting 
of  subsets  of  R)  such  that  UneZ+B„  =  [0, 1]. 

Problem  8.21.  Find  an  example  of  an  indexed  collection  of  sets  {A  j  :  j  £  Z+}  such 
that  Aj+ 1  C  Aj  for  each  j  £  Z+,  and  f)7=i  ^  ®- 


Chapter  9 

The  Power  Set  and  the  Cartesian  Product 


Now  that  we  know  about  sets,  we  can  construct  some  new  ones  from  old  ones  in 
even  more  ways  than  we  did  before.  In  this  section  we  look  closely  at  two  special 
sets:  the  first  is  called  the  power  set,  and  the  second  is  called  the  Cartesian  product 
of  two  sets. 

Let  S  be  a  set.  Then  the  power  set  of  S  is  the  set  of  all  subsets  of  S.  We  shall 
denote  the  power  set  by  &(S).  When  working  with  the  power  set,  the  following 
observation  will  be  extremely  useful:  A  £  &(S)  if  and  only  if  ,4  C  .S'. 

Before  we  begin,  note  that  the  power  set  is  again  a  set  and  its  elements  are  also 
sets.  The  power  set  is  never  empty.  Why? 

Example  9.1.  Consider  the  set  S  =  {0, 1}.  We  look  for  all  subsets  of  S.  They  are 
0,  {0},  { 1 },  and  {0,1}.  Then  the  power  set  of  S  is  &($)  =  {0,  {0},{1},{0, 1}}.  O 


As  you  probably  noticed,  the  notation  is  tricky  here.  You  have  to  distinguish  very 
carefully  between  “element  of”  (£)  and  “subset  of”  (C).  Until  you  get  really  good  at 
this  distinction,  we  suggest  that  you  ask  yourself  each  time  you  use  £  or  C  whether 
you  chose  the  correct  symbol.  It’s  time  for  some  practice  with  this  notation. 

Exercise  9.2.  Let  A  be  a  set.  Which  of  the  following  are  true,  which  are  false?  Ex¬ 
plain. 

(a)  A  £  £?(A); 

(b)  0C^(A); 

(c)  0=^(0); 

(d)  {0}  =  ^(0); 

(e)  if  a  £  A,  then  {a}  C  ^(A). 

Now  we  are  ready  to  explore  the  power  set  construction  further. 

Exercise  9.3.  Let  A  =  {1,2,3},  B  =  {2,5},  C  =  {0,1}. 

(a)  Find  £P(B)  and  ,^(C).  Do  these  two  sets  have  elements  in  common? 

(b)  Find  ^(A),  £?( B ),  £P(AnB),  and  ^(AUfi). 
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(c)  Compute  2?(A)  U  £P{B)  and  £P(A)  n  £P(B).  Q 

Remember  to  use  element  notation  when  you  are  thinking  of  the  set  as  an  element 
and  subset  notation  when  you  are  showing  containment  of  sets. 

In  the  next  exercise,  we  will  ask  you  to  prove  that  two  sets  are  equal.  We’ve  done 
this  many  times  in  the  previous  chapters,  and  so  you  know  one  way  to  begin:  use  an 
element-chasing  argument.  Ask  yourself  if  your  set  plays  the  role  of  a  set  or  the  role 
of  an  element,  and  use  the  corresponding  notation. 

Exercise  9.4.  Let  A  and  B  be  sets.  Prove  that  AT1  (A  P  B)  =  & (A )  n  8P(B).  O 

When  we  talk  about  a  set,  it  is  understood  that  if  we  discuss  the  set  {1,3}  we 
are  discussing  the  set  {3, 1}  as  well.  A  set  is  determined  by  its  elements  and  there 
is  no  notion  of  order  associated  with  what  we  have  defined  so  far.  When  there  is  an 
order,  such  as  when  we  plot  points  and  need  to  know  which  is  the  x  coordinate  and 
which  is  the  y  coordinate,  we  use  the  notion  of  an  ordered  pair.  The  next  set  we  will 
consider  is  called  the  Cartesian  product  of  two  sets  X  and  Y,  and  it  is  constructed 
using  ordered  pairs. 

Here  is  our  informal  definition:  An  ordered  pair  (x,y)  is  a  pair  of  objects  in 
which  there  is  a  first  object  x  and  a  second  object  y.  The  very  important  property  of 
ordered  pairs  is  that  (x,y)  =  (z,  w)  if  and  only  if  x  =  z  and  v  =  w. 

We  may  now  define  the  Cartesian  product  of  X  and  Y ,  denoted  X  x  Y,  to  be  the 
set  of  all  ordered  pairs  in  which  the  first  element  comes  from  X  and  the  second  from 
Y ;  that  is. 


XxY  =  {{x,y):x£X,yeY}. 

For  example,  if  X  =  [0,1]  and  Y  =  [0,2],  then 

XxY  =  {{x1y)  :0<x<  1,0  <y  <2}. 

This  is  the  rectangle  in  K2  with  base  along  the  interval  [0, 1]  and  height  along  the 
interval  [0,2]  sketched  in  Figure  9.1. 

Perhaps  you  are  wondering  why  we  said  “informal  definition”  when  we  presented 
our  definition  of  ordered  pair.  Since  this  is  probably  the  definition  you  were  expect¬ 
ing,  it  most  likely  looks  formal.  It  turns  out  that  there  is  a  rigorous  definition  of 
ordered  pair;  one  that  can  be  presented  without  referring  to  the  “first”  and  “second” 
coordinates.  The  reason  we  do  not  present  it  here  is  that,  in  our  opinion,  a  rigorous 
definition  is  mostly  confusing  rather  than  helpful  at  this  point.  If  you  have  a  strong 
desire  to  know  more  about  this,  you  can  work  Problem  9.23  in  this  chapter. 

It’s  time  for  a  few  more  examples  of  Cartesian  products. 

Exercise  9.5.  (a)  Write  out  all  the  elements  in  {0, 1}  x  {2,3}  and  {2,3}  x  {0, 1}. 

(b)  Sketch  the  Cartesian  products  [0, 1]  x  [2,3]  and  [2,3]  x  [0, 1]  as  sets  of  points 
in  the  plane. 
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Fig.  9.1  [0,1]  x  [0,2] 


y  A 

2  — 


1 


1 


2 


(c)  Recall  that  we  defined  R2  =  {(x,y)  :  x  £  R,y  £  R}.  Write  R2  using  the  Carte¬ 
sian  product  notation. 

(d)  Having  done  that,  can  you  describe  R3  as  a  Cartesian  product  of  two  sets? 
(You  might  have  more  than  one  description  that  seems  reasonable  to  you.) 

(e)  The  set  of  even  integers  is  denoted  by  2Z  (see  Example  6.1).  Make  sketches 

that  describe  the  sets  Z  x  Z,  Z  x  2Z,  and  2Z  x  Z.  Q 


When  we  prove  that  two  sets  defined  using  Cartesian  products  are  equal,  we  can 
still  use  the  method  of  “element-chasing.”  Remember  that  we  need  to  use  the  special 
form  of  the  element,  namely,  that  it  looks  like  an  ordered  pair.  If  you  don’t  use 
the  form  of  an  element,  you  may  lose  valuable  information  and,  as  a  consequence, 
proving  your  result  will  be  tougher  than  it  has  to  be.  As  you  read  the  proofs  below, 
pay  attention  to  where  we  use  the  fact  that  the  element  we  consider  is  an  ordered 
pair. 

Theorem  9.6.  Let  A  be  a  set.  Then  A  x  0  =  0. 

Proof.  Suppose,  to  the  contrary,  that  Ax0^0.  Then  there  exists  an  element  (x,y)  £ 
A  x  0.  Therefore,  by  our  definition  of  Cartesian  product,  x  £  A  and  y  £  0.  But  this 
contradicts  the  fact  that  0  is  the  empty  set.  Thus  A  x  0  =  0.  □ 

Theorem  9.7.  Let  A,B,C,  and  D  be  sets.  Then 

(A  xB)U(CxD)  C  (AUC)  x  (BUD). 

Proof.  If  z  £  (A  x  B)  U  (C  x  D),  then  z  =  (x,y)  where  (x,y)  £AxB  or  (x,y)  £C  xD. 
Suppose  first  that  (x,y)  £AxB.  Then  x  £  A  and  y  £  B.  In  this  case  x  £  A  UC  and  y  £ 
BUD,  so  by  definition  (x,y)  £  (AUC)  x  (BUD).  Now  suppose  that  (x,y)  £  C  x  D. 
Thenx  £  C  and  y  £  D.  Therefore  x  £  AUC  and  y  £  BUD.  So  (x.y)  £  (AUC)  x  (BU 
D) .  Hence  (Ax5)U(CxD)  C  (AUC)  x  (BUD),  as  desired.  □ 

Again,  notice  how  quickly  we  changed  from  z  to  (x,  y)  in  the  proof.  That’s  be¬ 
cause  we  can’t  do  anything  if  we  don’t  realize  that  z  is  really  an  ordered  pair. 

Now  consider  the  following  nontheorem. 
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Nontheorem.  Let  A .  B ,C .  and  D  be  sets.  Then 

(AUC)  x  (BUD)  C  (AxB)U(CxD). 


Not  a  proof.  If  (x,y)  £  (AUC)  x  (BUD),  thenx  €  AUC  andy  £  DUD.  Thusx  £  A  or 
x  £  C,  and  y  £  B  or  y  £  D.  Hence  x  £  A  and  y  £  B  or  x  £  C  and  y  £  Z).  So  (x,y)  £  A  x  D 
or  (x,y)  £  C  x  Z).  Thus  (x,y)  £  (A  x  D)  U  (C  x  D).  H 

Exercise  9.8.  Find  the  error  in  the  nonproof  above  and  show  that  Nontheorem  9 
really  is  not  a  theorem  because  the  statement  is  false.  (Find  sets  for  which  the  state¬ 
ment  does  not  hold.)  O 

In  these  problems  and  all  that  follow,  you  will  begin  with  an  element  in  your 
set.  It  will  be  helpful  to  you  to  think  about  the  form  of  your  element.  Is  it  a  set? 
an  ordered  pair?  If  you  rush  through  these  proofs,  as  we  did  in  Nontheorem  9,  you 
will  prove  things  that  are  false.  This  is  generally  frowned  upon  in  mathematics.  Go 
slowly,  be  careful,  and  check  each  step. 

We  will  now  define  relations.  We  will  soon  see  that  there  is  a  connection  between 
functions  (something  you  probably  feel  familiar  with)  and  relations  (something  you 
may  not  feel  terribly  familiar  with).  We  begin  with  a  definition. 

Suppose  that  X  and  Y  are  two  sets.  A  relation  from  X  to  Y  is  a  subset  of  X  x  Y. 
A  relation  from  A  to  A  is  called  a  relation  on  A. 

Exercise  9.9.  For  the  following,  decide  whether  or  not  they  are  relations  from  a 
set  A  to  a  set  Y.  If  they  are,  say  what  A  is  and  what  Y  is.  Then  describe  each  set 
pictorially  (as  a  set  of  points  in  the  plane)  or  in  words: 

(a)  {(x,y)  £  M2  :  x  <  y}; 

(b)  {x/y  :x,y  £  Z  andy  ^  0}; 

(c)  j(x,y)  £  R2  :  x,y  £  Z  and  x  +  y  =  0}.  O 

We  will  learn  more  about  relations  in  Chapter  10. 


Definitions 

Definition  9.1.  The  power  set  of  a  set  S  is  the  set  of  all  subsets  of  S.  It  is  denoted 

by  &>($). 

Definition  9.2  (see  Problem  9.23  for  a  formal  definition).  The  ordered  pair  of 

the  objects  x  and  y  is  the  object  (x,y)  in  which  x  is  considered  to  be  the  first  object 
and  y  the  second  object. 


9  The  Power  Set  and  the  Cartesian  Product 


93 


Definition  9.3.  The  Cartesian  product  of  sets  X  and  Y  is  the  set  of  ordered  pairs 

XxY  =  {(x,y)  :x£X,y£Y}. 

Definition  9.4.  A  relation  from  a  set  A  to  a  set  Y  is  a  subset  of  X  x  Y.  If  Y  =  X, 

we  say  the  relation  is  a  relation  on  X. 


Solutions  to  Exercises 

Solution  (9.2). 

(a)  True.  We  have  A  C  A  and  thus  A  £  £?(A). 

(b)  True  (e.g.,  by  Theorem  6. 1 1). 

(c)  False.  Since  0  C  0  we  get  0  £  3P(0)  and  thus  ^(0)  7^  0. 

(d)  True.  See  the  argument  for  (c)  and  note  that  if  B  C  0,  then  B  =  0. 

(e)  False.  Here  is  a  counterexample:  Let  A  =  {0},  then  ^P(A)  =  {0,  {0} }  and 
0  G  A  but  {0}  2  ^(A). 

Solution  (9.3). 

(a)  &>(B)  =  {0, {2}, {5}, {2,5}},  &>(C)  =  {0, {0}, {1}, {0, 1}},  and  the  empty 
set  is  an  element  of  both  sets. 

(b)  You  can  check  that  &(A)  =  {0,  {1},  {2},  {3},  {1,2},  {1,3},  {2, 3},  {1,2, 3}} 
and  ^(AdB)  =  {0,{2}}.  We  leave  ^(AUB)  to  you. 

(c)  ^(A)  (T  ^(B)  =  {0,  {2}}.  We  leave  3?(A)  U  3?(B)  to  you. 

Solution  (9.4).  If  x  £  .3^ (.4  P  B),  then  x  C  A  P  B.  Thus  x  C  A  and  x  C  B.  'I’his  implies 
that  x  £  £?(A)  and  x  £  t?(B),  so  x  £  &(A)  (T  £P(B).  Since  x  was  an  arbitrarily 
chosen  element,  ^(ADB)  C  &(A)  fl  &(B).  Each  of  these  steps  is  reversible,  so 
the  containment  ^(A)  fl  £?(B)  C  ^(AnB)  follows  as  well. 
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Fig.  9.2  [0, 1]  x  [2,3]  and  [2,3]  x  [0, 1] 
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Fig.  9.3  Z  x  Z,  2  x  2Z,  and  2Z  x  Z 


Solution  (9.5). 

(a)  The  two  sets  are 

{0, 1}  x  {2,3}  =  {(0,2),  (0,3),  (1,2),  (1,3)} 
and 

{2,3}  x  {0, 1}  =  {(2,0), (3,0), (2, 1), (3, 1)}. 

(b)  The  two  sets  are  sketched  in  Figure  9.2. 

(c)  K2  =  RxR. 

(d)  One  answer  might  be  R2  x  M. 

(e)  These  are  sketched  in  Figure  9.3. 

Solution  (9.8).  Our  “Not  a  proof”  claims  that  “xGA  or  x  €  C,  and  y  €  B  or  y  G  D. 
Hence  x  G  A  and  y  G  B  or  x  G  C  and  y  G  DT  This  conclusion  is  not  justified:  it  could 
also  be  that  x  G  A  and  y  G  D,  or  x  G  C  and  y  G  B. 

As  an  example,  let  A  =  D  =  0  and  B  =  C  =  R.  Then  (AUC)x(BUfl)  =  RxR, 
while  (AxB)U(C  xD)  =  0. 

Solution  (9.9). 

(a)  This  is  a  relation  from  X  =  R  to  Y  =  R,  consisting  of  the  set  of  points  in  R2 
lying  above  or  on  the  line  y  =  x. 
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(b)  This  is  not  a  subset  of  X  x  Y  for  any  choice  of  X  and  Y,  hence  this  is  not  a 
relation. 

(c)  This  is  a  relation  from  X  =  Z  to  Y  =  Z,  and  consists  of  the  points  for  which  x 
is  an  integer  and  y  =  — x;  that  is,  this  is  the  set  {(x,  — x)  :  x  G  Z}. 


Problems 

Problem  9.1.  Let  S  =  {a,b,c}.  Find  &(S). 

Problem  9.2.  Replace  ?  by  the  proper  symbol,  choosing  from  among  the  following: 

S,  C,  or  C. 

(a)  N?Z; 

(b)  {1,2,3}  7  Z; 

(c)  {5}?Z; 

(d)  Z+  ?  ^(Z); 

(e)  {1,2,3}?  £*({1,2,3}); 

(f)  5  ?  Z; 

(g) 

(h)  0?^(Z). 

Problem  9.3.  Give  an  explicit  description  of  £P(^({1}))  by  listing  all  its  elements. 

Problem  9.4.  Say  whether  the  following  are  true  or  false  and  give  a  reason  for  your 
answer. 

(a)  {0}  C  A  for  all  sets  A. 

(b)  0  C  ^(A)  for  all  sets  A. 

(c)  If  A  =  {x,y},  then  &>(A)  =  {{x},  {>’},  {x,y},  {0}}. 

(d)  If  A  =  {x,y},  ^({A})  =  {{x},{y},{x,y},0}. 

(e)  If  Ao  =  {0},  then  Ao  S  ^(A)  for  all  sets  A. 

Problem  9.5.  (a)  Show  that  £^(A)  U  C  AAiA  L  B). 

(b)  Show  that  ^(A)  U &{B)  ^  £P(AUB)  in  general,  by  exhibiting  two  concrete 
sets,  A  and  B ,  for  which  the  aforementioned  inequality  holds. 

Problem  9.6.  (a)  Let  X  be  anonempty  set  in  &>(A\B).  MustX  €  ^32(A)\ A?(B)! 

(b)  Prove  that  it  is  never  the  case  that  £?(A\B)  =  £?(A)  \  &(B). 

Problem  9.7.  Let  2Z  denote  the  even  integers  and  2Z  +  1  denote  the  odd  integers. 
What  is  ^(2Z)(T^(2Z+1)? 

Problem  9.8.  Show  that  A  C  B  if  and  only  if  YX (A )  C  fXiB). 

Problem  9.9.  For  every  set  I  and  for  every  indexed  collection  of  sets  {Aa  :«£/(, 
prove  that 
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Problem  9.10.  Let  {Aa  :  cc  £  7}  be  a  nonempty  indexed  collection  of  sets.  Prove 

that  |  P|  A„  J  =  P|  ^{Aa). 

\aei  J  ael 

Problem  9.11.  How  many  elements  are  there  in  the  power  set  of  {1,2, 3,4}?  How 
many  elements  are  in  the  power  set  of  (1,2, 3, 4, 5}?  State  a  general  result.  You’ll 
be  able  to  prove  it  later. 

Problem  9.12.  Describe  the  following  relations  pictorially  (as  a  set  of  points  in  the 
plane)  or  in  words: 


(a) 

{( x,y )  g 

N  x  Z  :  x  >  y}; 

(b) 

{(■ry)  G 

R2  :  x  =  y}; 

(c) 

{(■lt)  G 

Z  x  Z  :  x+y  G  2Z}; 

(d) 

{0,1}  x 

N; 

(e) 

{(xo-2)  : 

x  G  R}; 

(f) 

{(Vx,x) 

:xG  Z+}. 

Problem  9.13.  Describe  the  following  Cartesian  products: 

(a)  0  x  N; 

(b)  Z  x  0; 

(c)  RxK; 

(d)  IxZ. 

Problem  9.14.  Which  of  the  following  sets  can  be  written  as  the  Cartesian  product 
of  two  subsets  of  M?  (Either  give  the  two  sets  or  explain  why  two  such  sets  do  not 
exist.) 

(a)  {(x,y):0<y<5}; 

(b)  {(x,y)\x>y}\ 

(c)  {{x,y)  :  x2+y2  =  1}. 

Problem  9.15.  Show  that  N  x  N  C  Z  x  Z. 

Problem  9.16.  (a)  In  the  plane,  sketch  the  set  [0, 1]  x  ([1,3]  U  [2,4]). 

(b)  Sketch  ([0, 1]  U  [1,4])  x  ([0, 1]  U  [2,4]). 

Problem  9.17.  Let  A,  B,  C,  and  D  be  nonempty  sets.  Then  A  x  B  =  C  x  D  if  and 
only  if  A  =  C  and  B  =  D. 

(a)  Prove  this  statement. 

(b)  One  of  the  two  implications  does  not  require  the  sets  to  be  nonempty.  Which 
one? 

(c)  If  we  do  not  require  the  sets  to  be  nonempty,  then  the  statement  is  false.  Give 
examples  of  sets  A,B,C,  and  I)  to  show  the  necessity  of  the  assumption  that 
the  sets  be  nonempty. 
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Problem  9.18.  Suppose  A,B,C,  and  D  are  four  sets.  If  Ax  B  CC  x  D,  must  ACC 
and  BCD ?  Why  or  why  not? 

Problem  9.19.  Let  A,B,  and  Cbe  sets.  If  the  statements  below  are  tme,  prove  them. 
If  they  are  false,  give  a  counterexample: 

(a)  A  x  (BUC)  =  (Ax  B)U  (Ax  C); 

(b)  A  x  (fine)  =  (A  x  B)  n  (A  x  C). 

Problem  9.20.  LetA=  {1 ,  { 1 },  {1 ,  { 1 }}}. 

(a)  Find  Ax  A. 

(b)  Find  A  n  £P(A). 

Problem  9.21.  Let  A  =  {0, 1 }.  We  define  a  relation  R  from  A  to  the  power  set  of  A 
by  R  =  {(x:,y)  cAx  £P(A)  :  x  £  >’}.  List  all  elements  of  R. 

Problem  9.22.  Let  {Aa  :  a  £  /}  and  {Bp  :  £  /}  be  two  indexed  collections  with 

nonempty  index  sets  I  and  J.  Prove  that 


Problem  9.23.  This  problem  introduces  rigorous  definitions  of  an  ordered  pair  and 
Cartesian  product.  Let  A  be  a  set  and  a,b  £  A.  We  define  the  ordered  pair  of  a  and  b 
with  first  coordinate  a  and  second  coordinate  b  as 


(a,b)  =  {{a},{a,b}}. 


Using  this  definition  prove  the  following. 

(a)  If  ( a,b )  =  (x,y),  then  a  =  x  and  b  =y. 

(b)  If  a  £  A  and  b  £  B,  then  (a,b)  £  ^(^(ALIfi)). 

Now  we  are  able  to  define  the  Cartesian  product  of  the  two  sets  A  and  B  as  the  set 
AxB  =  {x  £  ^(^(AUB))  :  x  =  (a,b)  for  some  a  £  A  and  some  b  £  B}. 


(c)  Using  the  definitions  introduced  in  this  problem,  prove  that  if  A  C  C  and 
BCD ,  then  Ax  B  CC  xD. 

This  is  a  pretty  complicated  definition.  It  is  also  not  our  idea,  but  rather  an  idea 
that  was  born  from  axioms.  P.  Halmos’s  book,  [41],  is  an  excellent  reference  for  this 
subject. 
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Tips  on  Writing  Mathematics 


This  letter  is  longer  than  usual  simply  because  1  could  not  spare  the  time  to  make  it 
shorter. — Blaise  Pascal,  [80,  p.  865]1 

After  this  point  in  the  course  the  work  will  change.  You’ll  find  that  you  are  writ¬ 
ing  more  in  words  than  in  symbols.  How  you  write  is  as  important  as  what  you 

write.  Here  are  some  things  to  think  about  as  you  write  your  proofs. 

•  In  mathematics,  it  is  always  important  that  the  reader  know  what  the  variables 
stand  for.  This  was  true  in  algebra  in  high  school,  geometry,  and  calculus,  and  it 
is  true  here  too.  If  you  use  symbols — any  symbols — make  sure  the  meaning  is 
clear  to  the  reader  before  you  use  them. 

•  Think  about  your  notation,  and  choose  notation  that  is  easy  on  the  reader. 

•  A  variable  should  only  be  assigned  one  meaning  in  your  proof.  For  example, 
if  you  used  C  to  denote  the  complex  numbers,  don’t  use  C  again  to  denote  a 
different  set. 

•  Try  for  a  good  blend  of  symbols  and  words.  Don’t  juxtapose  unrelated  symbols 
if  you  don’t  have  to.  For  example,  consider  the  sentence  “So  1  <  p,q>  2.”  You 
might  find  this  confusing  and  (unnecessarily)  difficult  to  read.  If  we  say  “So 
1  <  p  and  q  >  2,”  the  sentence  is  clear.  It’s  often  easier  to  read  things  if  you  put 
a  word,  even  a  little  one,  between  symbols. 

•  Avoid  starting  a  sentence  with  a  symbol.  This  often  confuses  the  reader  unneces¬ 
sarily.  For  example,  consider  the  following  sentence. 

Thus  x  C  A.  A  is  a  subset  of  B. 

First,  the  A.  A  just  doesn’t  look  nice.  Second,  it’s  hard  to  read. 

•  Every  sentence  should  start  with  a  capital  letter  and  end  with  a  period,  just  like 
sentences  are  supposed  to  begin  and  end. 

•  All  grammatical  rules  apply.  Make  sure  your  sentence  has  a  noun  and  a  verb,  for 
example. 

•  Strive  for  clarity.  Always  keep  the  reader  in  mind.  If  something  follows  from 
a  definition,  say  so.  The  reader  will  appreciate  this  and  will  know  what  you  are 
thinking  and,  what’s  more,  you  will  know  why  what  you  say  is  true.  If  something 
follows  from  Theorem  10.1,  say  so.  It  is  extremely  important  for  you  to  be  aware 
of  when  you  are  using  a  result.  For  one  thing,  it  means  that  you  are  more  likely 
to  notice  if  you  are  using  a  result  that  you  do  not  have.  (This  would  be  wrong. 
Don’t  do  it.)  For  another,  it  helps  the  reader  who  may  not  fully  understand  what 
you  are  doing. 

•  Certain  phrases  are  particularly  helpful  in  guiding  a  reader  through  your  proof. 
For  example,  “Suppose  to  the  contrary,  . . .”  tells  the  reader  that  your  proof  will 
be  done  by  contradiction.  As  a  second  example,  if  you  are  proving  “A  if  and  only 
if  B,”  your  reader  will  understand  everything  better  if  you  say,  “Suppose  A.  . . . 
Then  we  have  B.”  And  then  say,  “Suppose  B.  . . .  Then  we  have  A.”  You  should 


1  The  translation  is  ours. 
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alert  the  reader  to  a  proof  that  will  be  in  cases,  or  a  proof  that  will  proceed  using 
the  contrapositive.  You  should  not  only  tell  the  reader  how  you  will  begin  the 
proof,  you  should  also  tell  the  reader  when  you  believe  you  have  completed  the 
proof.  Words  like  “thus,  we  have  established  the  desired  result”  will  let  the  reader 
know  that  you  think  you  are  done  now  and  it’s  his  or  her  turn  to  understand  why. 
Other  examples  of  phrases  that  you  may  use  to  guide  your  reader  will  come  up 
as  we  learn  new  techniques. 

•  If  you  can  find  a  shorter,  clearer  solution,  do  so. 

•  Perhaps  the  most  difficult  thing  about  writing  a  proof  is  to  find  a  balance  between 
the  main  ideas  in  the  proof  and  the  details.  You’ll  often  find  that  the  more  you 
explain,  the  more  you  hide  the  main  ideas.  On  the  other  hand,  if  you  don’t  explain 
enough,  you  might  overlook  an  important  detail  or  confuse  your  reader.  It’s  not 
easy  to  strike  the  right  balance.  This  is  why  we  suggest  waiting  a  bit,  and  then 
rereading  your  proof.  If  you  can’t  figure  out  why  you  did  something,  it’s  unlikely 
that  someone  else  will. 

•  If  you  have  a  partner  in  the  class,  it  is  an  excellent  idea  to  exchange  papers  and 
see  if  things  are  clear  to  each  of  you.  (Check  with  your  teacher  to  make  sure  this 
is  allowed,  of  course.) 

Exercise  9.10.  Here’s  a  student’s  proof  of  the  following  theorem:  Let  x  and  y  be  real 

numbers.  Show  that  xy  <  x2 /2+y2  /2. 

Proof  ( Student  version ). 


(x  —  y)2  >  0 
x2  — 2 xy+y2  >  0 
x2  +y2  >  2 xy 

x2 /2  +  y2 /2  >  xy  □ 

Criticize  the  student’s  solution  and  rewrite  the  proof,  paying  close  attention  to  the 
tips  presented  here.  O 

Example  9.11  (Adapting  an  idea  of  [112]).  If  you  are  trying  to  publish  a  paper,  you 
should  think  carefully  about  your  exposition.  Strive  for  clarity.  Here’s  an  example 
of  how  to  improve  mathematical  writing: 

Theorem  (Steiner’s  Theorem  I).  Given  any  triangle,  there  is  a  unique  ellipse  in¬ 
scribed  in  the  triangle  that  passes  through  the  midpoints  of  the  sides  of  the  triangle 
and  is  tangent  to  the  sides  of  the  triangle  at  these  three  midpoints. 

Let’s  see.  We’ve  used  the  word  “triangle”  four  times.  Let’s  try  to  eliminate  that 
repetition. 

Theorem  (Steiner’s  Theorem  II).  Given  any  triangle  T,  there  is  a  unique  ellipse 
inscribed  in  T  that  passes  through  the  midpoints  of  the  sides  ofT  and  is  tangent  to 
the  sides  ofT  at  these  three  midpoints. 
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Now,  if  the  ellipse  is  tangent  to  the  sides  of  T,  then  it  must  pass  through  those  points 
as  well.  So,  it  looks  like  that’s  a  phrase  we  can  omit.  Let’s  try  it. 

Theorem  (Steiner’s  Theorem  III).  Given  a  triangle  T,  there  is  a  unique  ellipse 
inscribed  in  T  that  is  tangent  to  each  of  the  sides  ofT  at  the  three  midpoints. 

This  looks  simpler  than  our  previous  statement.  Is  there  anything  else  we  can  do 
to  shorten  this?  Well,  yes.  Yes  there  is.  The  “unique  ellipse  inscribed  in  T”  is  too 
wordy.  If  we  say  that  the  ellipse  is  tangent  to  each  of  the  sides  of  T  at  the  midpoints, 
then  T  must  be  the  circumscribing  triangle.  Now,  do  we  really  have  to  say  “three” 
midpoints?  One  argument  in  favor  of  “three”  is  that  the  reader  will  immediately  see 
that  we  are  talking  about  all  possible  midpoints.  But  we’ve  got  that  covered:  we  said 
“tangent  to  each  of  the  sides.”  So  how  about: 

Theorem  (Steiner’s  Theorem  IV).  Given  a  triangle  T,  there  is  a  unique  inscribed 
ellipse  that  is  tangent  to  each  of  the  sides  ofT  at  the  midpoints. 

Is  this  the  shortest  possible  version?  If  a  convex  figure  is  tangent  to  each  side  of  a 
triangle,  then  it  is  necessarily  “inscribed.”  Thus  we  can  drop  this  word,  as  it  does 
not  convey  additional  information. 

Theorem  (Steiner’s  Theorem  V).  Given  a  triangle  T,  there  is  a  unique  ellipse  that 
is  tangent  to  each  of  the  sides  ofT  at  the  midpoints. 

Is  the  last  statement  really  the  “best”  one?  It  certainly  is  the  shortest.  The  reader 
may  have  to  think  for  a  moment  before  realizing  that  the  ellipse  is  inscribed.  If  we 
state  this  explicitly  rather  than  implicitly,  it  will  most  likely  aid  the  reader.  Most 
statements  that  we  found  of  this  theorem  include  the  word  “inscribed,”  perhaps  for 
this  reason.  Shortest  is  not  necessarily  equivalent  to  clearest!  In  this  case,  we  vote 
for  version  IV  as  the  clearest. 

Steiner’s  Theorem  is,  by  the  way,  a  beautiful  theorem.  You  can  find  a  lot  about  it 
on  the  Web  by  doing  a  search  for  “Steiner  inellipse”  or  in  the  book  [2,  pp.  52-53]. 

o 

For  other  (not  necessarily  independent)  views  on  writing  see  [42],  [62],  and 

[112]. 


Chapter  10 

Relations 


In  the  last  chapter  we  introduced  relations.  We  will  now  look  at  three  useful  proper¬ 
ties  of  relations. 

Recall  that  “S  is  a  relation  on  a  set  X ”  is  one  way  of  saying  that  S  is  a  subset  of 
X  x  X,  and  therefore  the  elements  of  S  are  ordered  pairs,  (x.y).  Many  authors  write 
x  ~  y  rather  than  (x.y)  £  S.  Sometimes  we  will  write  x  ~  y  and  other  times  we  will 
write  (x,y)  £  S,  and  this  is  exactly  the  same  thing.  So  why  do  it?  Because  sometimes 
one  notation  is  more  convenient  than  the  other.  Use  the  next  exercise  to  familiarize 
yourself  with  both  notations. 

Exercise  10.1.  Let  S  =  {(x,y)  £  R  x  R.  :  x  >  y}. 

(a)  Sketch  the  set  S. 

(b)  With  this  relation  is  1  ~  2? 

(c)  With  this  relation  is  3.5  ~  2?  O 

At  this  point,  we  investigate  one  more  relation  that  will  be  familiar  to  you. 

Exercise  10.2.  Let  X  be  a  nonempty  set.  We  consider  the  usual  set  inclusion  C  on 
the  set  ^(X). 

(a)  Using  the  ordered  pair  notation  and  calling  this  relation  R,  describe  the  set  R. 

(b)  Give  an  example  of  a  set  X  and  two  sets  A,  B  £  PXiX)  such  that  A  C  B. 

(c)  Give  an  example  of  a  set  X  and  two  different  sets  A,B  £  £P(X)  such  that 
A  (/LB. 

(d)  If  A,  B  £  8?(X)  with  A  B,  can  you  conclude  that  A  C  B  or  B  C  A?  O 

You  will  learn  more  about  the  relation  in  Exercise  10.2  above  in  Chapter  13.  We 
now  turn  to  another  important  type  of  relation. 

A  relation  on  a  set  X  is  said  to  be  reflexive  if  x  ~  x  for  all  x  £X.  The  relation 
is  symmetric  if  for  all  x,y  £  X,  whenever  x  ~  y,  then  y  ~  x.  Finally,  the  relation  is 
transitive  if  for  all  x,y,z  £  X,  if  x  ~  y  and  y  ~  z,  then  x  ~  z.  If  a  relation  is  reflexive, 
symmetric,  and  transitive,  then  the  relation  is  said  to  be  an  equivalence  relation. 
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For  the  remainder  of  this  chapter,  we  will  look  at  examples  and  nonexamples  of 
equivalence  relations.  There’s  something  to  note  before  we  begin.  To  show  that  a 
relation  is  reflexive,  we  show  that  x  ~  x  for  all  x  G  X.  But  to  show  it  is  symmetric, 
we  must  choose  two  arbitrary  elements  of  X,  suppose  that  x  ~  y,  and  then  show 
that  y  ~  x.  If  we  don’t  use  the  fact  that  x  ~  y,  we  probably  haven’t  done  it  correctly. 
Finally,  to  show  that  a  relation  is  transitive,  we  must  choose  three  arbitrary  elements, 
suppose  that  x  ~  y  and  y  ~  z,  and  then  show  that  x  ~  z.  Remember  to  use  your 
assumptions  to  show  that  a  relation  is  symmetric  or  transitive. 

Example  10.3.  Define  a  relation  on  the  real  numbers  R  by  x  ~  y  if  and  only  if 
x  —  y  G  Z.  Show  that  this  relation  is  an  equivalence  relation. 

Before  we  begin  to  show  that  this  is  an  equivalence  relation,  we  will  do  appropri¬ 
ate  things  to  understand  this  definition.  Here  are  a  few  examples  of  pairs  that  satisfy 
the  relation: 


3~4,  0  ~  —2384,  7  ~7,  n~n  +  l,  -3.7  ~  4.3. 

On  the  other  hand,  the  following  pairs  do  not  satisfy  the  relation: 

3/3.5,  0/7 t,  -3.7/ 3.7. 

If  you  have  a  sense  of  what  the  relation  does,  you  are  ready  to  move  on  to  the 
proof.  (If  you  don’t  have  a  sense  of  what  is  happening,  look  for  more  examples  and 
nonexamples.) 

Proof.  To  show  that  this  relation  is  reflexive,  let  x  G  R.  Then  x  —  x  =  0.  Since  OsZ, 
we  see  that  x  —  x  G  Z.  Therefore,  x  ~  x  for  all  x  G  R  and  ~  is  reflexive. 

To  show  that  this  relation  is  symmetric,  let  x,y  G  R.  If  x  ~  y,  then  x—yG  Z.  But 
y  —  x  =  —  (x  —  y)  G  Z,  and  therefore  y  ~  x.  Hence  this  relation  is  symmetric. 

To  show  that  this  relation  is  transitive,  let  x,y,z  G  R.  If  x  ~  y  and  v  ~  z,  then 
x  —  yG  Z  and  y  —  z  G  Z.  Now  the  sum  of  two  integers  is  an  integer  and  therefore 
x  —  z=  (x  —  y)  +  (y  —  z)  G  Z.  In  other  words,  x  ~  z.  Thus  x  ~  z  is  transitive.  □ 

Exercise  10.4.  Let  R  be  a  relation  on  a  set  X.  Write  the  definition  of  each  of  the 
following  in  symbolic  notation:  R  is  reflexive,  symmetric,  and  transitive  on  X .  Once 
you  have  completed  this  task,  negate  each  of  your  (symbolic)  definitions.  O 

In  Example  10.3  it’s  interesting  to  try  to  describe,  in  words,  the  set  of  numbers 
that  are  related  to  0,  1/2,  n,  and  x.  As  is  the  case  with  our  examples  and  nonexam¬ 
ples  appearing  above,  we  hope  that  describing  these  sets  will  help  us  to  more  fully 
understand  this  relation. 

For  0,  we  look  for  {x€R:x~0}  =  {r£R:r- 0  GZ}.  Thus,  the  set  of  elements 
related  to  0  is  just  Z. 

For  1  /2,  we  look  for  {;r  G  R  :  x  ~  1  /2}  ={x£R:x  —  1/2  £  Z}.  Thus,  the  set  of 
all  elements  related  to  1/2  is  the  set  {1/2  +  k  :  k  GZ}. 
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For  71,  we  look  for  {x  £M.:  x  —  7t  £Z}.  Thus,  the  set  of  all  elements  related  to  n 
is  the  set  {n  +  k  :  k  £  Z}. 

In  general  it  appears  that  for  every  igR,  the  set  of  all  elements  related  to  x  is 
the  set  {x  +  k  :  k  £  Z}. 

Once  we  have  an  equivalence  relation  on  a  set  X,  we  define  the  equivalence 
class  of  an  element  x  €  X  to  be  the  set  Ex  where  Ex  =  {y  £  X  :  x  ~  y}.  It  might 
help  to  think  of  equivalence  classes  as  houses. “Being  in  the  household  of”  is  the 
equivalence  relation  on  the  people  in  the  town  and  the  houses  form  the  equivalence 
classes.  Saying  the  relation  is  reflexive  is  the  same  as  saying  each  person  lives  in  the 
same  house  as  herself.  The  fact  that  the  relation  is  symmetric  implies  that  if  Barbara 
lives  in  the  same  house  as  Bob,  then  Bob  lives  in  the  same  house  as  Barbara.  Finally, 
the  fact  that  the  relation  is  transitive  implies  that  if  Louis  lives  in  the  same  house  as 
Lois  and  Lois  lives  in  the  same  house  as  Lilly,  then  Louis  and  Lilly  live  in  the 
same  house.  If  David,  Esther,  Florian,  and  Gregg  all  live  in  the  house  and  David 
is  your  best  friend,  you’ll  probably  call  the  house  David’s  house.  If  Esther  is  your 
best  buddy,  though,  you  might  call  it  Esther’s  house.  No  matter  what  you  call  the 
house,  it’s  the  same  house  and  the  same  four  people  live  there.  And  these  are  the 
main  things  to  remember:  equivalence  classes  are  nonempty,  x  is  always  in  the  same 
class  as  x,  if  x  is  in  the  same  class  as  y,  then  y  is  in  the  same  class  as  x,  and  if  x  is 
in  the  same  class  as  y  and  y  is  in  the  same  class  as  z,  then  x  and  z  are  in  the  same 
class.  When  x  and  y  are  in  the  class,  you  might  call  the  class  Ex  or  you  might  call  it 
Ey\  either  way,  it  will  be  the  same  class,  as  we  will  show  more  precisely  below.  It’s 
time  for  a  little  practice  with  this  new  definition. 

Using  the  equivalence  class  notation,  we  see  that  the  sets  of  points  related  to  0, 
1/2,  it,  and  x  following  Example  10.3  were  actually  descriptions  of  Eq .  E^/2,  En, 
and  Ex. 

Exercise  10.5.  Write  each  relation  below  using  set  notation.  Then  decide  whether 
or  not  the  following  relations  are  reflexive,  symmetric,  or  transitive.  If  they  are  all 
three,  prove  it  and  describe  the  equivalence  classes.  If  they  are  not,  give  a  particular 
example  to  show  why  the  property  fails  to  hold. 

(a)  Define  a  relation  on  Z  by  x  ~  y  if  and  only  if  x  =  —y. 

(b)  Define  a  relation  on  Z  by  x  ~  y  if  and  only  if  x  —  y  is  even. 

(c)  Define  a  relation  on  Z  x  (Z  \  {0})  by  (x,y)  ~  (w,z)  if  and  only  if  xz  =  yw.  O 


Example  10.6.  We  look  at  /?  =  {(1, 1),  (2,2),  (3,3),  (1,3)}.  This  is  a  relation  on  the 
set  X  =  {1,2,3}.  We  can  represent  this  relation  with  a  diagram.  Such  a  diagram  is 
obtained  using  a  point  for  each  element  of  X,  and  these  will  be  the  vertices  of  the 
diagram.  If  (x,y)  £  R,  then  we  will  sketch  an  arrow  from  x  to  y.  These  arrows  are  the 
directed  edges  of  the  diagram  (that  is,  they  are  edges  with  an  indicated  direction). 
Figure  10.1  shows  the  diagram  of  our  relation  R.  (Such  a  diagram  is  usually  called 
a  directed  graph  or  a  digraph  in  graph  theory.  Since  we  will  soon  use  the  term  graph 
for  something  else,  we  will  call  such  a  picture  a  diagram.)  Q 
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Fig.  10.1  Diagram  of  a  relation  on  {1,2,3} 


Exercise  10.7.  Give  a  detailed  description  of  a  diagram  that  came  from  an  equiva¬ 
lence  relation;  that  is,  give  precise  conditions  that  a  diagram  must  satisfy  in  order 
to  fulfill  each  of  the  conditions  an  equivalence  relation  must  satisfy.  How  can  you 
identify  the  equivalence  classes  in  the  diagram?  O 


Definitions 

Definition  10.1.  A  relation  ~  on  a  set  X  is  reflexive  if  x  ~  x  for  all  x  GX. 

Definition  10.2.  A  relation  ~  on  a  set  X  is  symmetric  if  for  all  x7y  G  X,  whenever 
x  ~  y,  then  y  ~  x. 

Definition  10.3.  A  relation  ~ona  set  X  is  transitive  if  for  all  x.y.z.  G  X,  whenever 

x  ~  y  and  y  ~  z,  then  x  ~  z. 

Definition  10.4.  A  relation  on  a  set  X  is  an  equivalence  relation  if  it  is  reflexive, 
symmetric,  and  transitive. 

Definition  10.5.  Given  an  equivalence  relation  ~  on  a  set  X,  the  equivalence  class 
of  x  G  X  is  the  set  Ex  =  {y  G  X  ;  x  ~  y} . 


Solutions  to  Exercises 


Solution  (10.1).  The  set  S  is  represented  by  the  shaded  area  of  Figure  10.2. 

Using  the  defined  relation,  (1,2)  ^  S  but  (3.5,2)  G  S.  Thus,  the  answer  to  (b)  is 
no,  and  the  answer  to  (c)  is  yes. 

Solution  (10.2). 

(a)  The  relation  is  R  =  {(A,Z?)  G  Z?(X)  x  £P(X)  :  A  C  B}. 
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Fig.  10.2  {(x,y)  6  IK  x  R  :  x  >  y} 


(b)  For  our  example,  we  may  take  X  to  be  any  nonempty  set.  Then  0  and  X  are 
different,  both  are  in  2P(X),  and  0  C  X. 

(c)  Using  the  same  sets  as  above,  we  have  X  (jL  0. 

(d)  In  general,  we  cannot  conclude  from  A,  B  G  LXiX)  with  B  that  A  C  B  or 
B  C  A.  To  see  this,  let  {a,b}  C  X  with  a  ^  b.  Then  {«},{£>}  G  £P{X)  and 
{a}  ^  {£>}.  But,  {a}  jL  {b)  and  {/?}  ^  {a}.  However,  the  conclusion  does 
hold  if  X  contains  only  one  element.  In  that  case  XX  (X)  =  {0.  A" }  and  0  C  X. 

Solution  (10.4).  Since  R  is  a  relation  on  the  set  X,  we  have  R  C  X  x  X.  The  universe 
for  all  variables  below  is  the  set  X.  Then, 

R  is  reflexive,  if  Vx,  (x,x)  G  R. 

R  is  symmetric,  if  Vx,Vy,  ((x,y)  G  R  — >  (y,x)  G  R). 

R  is  transitive,  if  Vx,  Vy,Vz,  (((x,y)  G  R  A  ( y,z )  G  R)  — >  (x,z)  G  R). 

R  is  not  reflexive,  if  3x,  (x,x)  jL  R. 

R  is  not  symmetric,  if  3x,  3 y,  ((x,y)  G  R  A  (y,x)  jLR). 

R  is  not  transitive,  if  3x,  3y,  3  z,  ((x,y)  GRA(y,z)  GR  A  (x,z)  ^  R). 

Solution  (10.5). 

(a)  This  relation  is  neither  reflexive  nor  transitive  (but  it  is  symmetric).  It  is  not 
reflexive  because,  for  example,  1^—1  and  therefore  1  /  1 .  It  is  not  transitive 
because  1  ~  —  1  and  —  1  ~  1,  but  1^1. 

(b)  This  relation  is  an  equivalence  relation.  To  see  this,  let  x  G  Z.  Then  x  — x  =  0 
and  therefore  x  ~  x,  which  shows  that  ~  is  reflexive.  To  show  symmetry, 
let  x,y  G  Z.  If  x  ~  y,  then  x  —  y  is  even.  Since  y  —  x  =  —(x  —  y),  it  follows 
that  y  —  x  is  even.  Therefore  y  ~  x,  and  ~  is  symmetric.  For  transitivity,  let 
x,y,z  G  Z.  If  x  ~  y  and  y  ~  z,  then  x  —  y  and  y  —  z  are  both  even;  in  other 
words,  there  exist  integers  m  and  n  such  that  x  —  y  —  2m  and  y  —  z  =  2n.  Now, 
x  —  z  =  (x  —  y)  +  (y  —  z)  =  2m  +  2 n  =  2 (in  +  n),  and  m  +  n  G  Z.  Thus  x  —  z  is 
even.  Therefore  x  ~  z,  and  ~  is  transitive.  We  conclude  that  the  relation  is  an 
equivalence  relation. 
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What  are  the  equivalence  classes?  You  can  check  that  if  x  is  even,  then  the 
equivalence  class  corresponding  to  x  is  the  set  of  even  numbers.  If  x  is  odd, 
then  the  equivalence  class  corresponding  to  x  is  the  set  of  odd  numbers. 

(c)  This  relation  is  an  equivalence  relation. 

If  (x,y)  eZx(Z  \  {0}),  then  xy  =  yx.  Therefore,  (x,y)  ~  (x,y).  Thus,  the 
relation  is  reflexive. 

If  (x,y),  (w,z)  G  Z  x  (Z\  {0})  and  ( x,y )  ~  (w,z),  then  xz  =  yw.  This  implies 
that  wy  =  zx.  Thus,  (w,z)  ~  (x,y).  This  shows  that  the  relation  is  symmetric. 
Suppose  (x,y),(u,v),  and  (w,z)  are  elements  of  Z  x  (Z\{0})  such  that 
(x,y)  ~  ( u,v )  and  (u,v)  ~  (w,z).  By  definition,  y,v,  and  z  are  all  nonzero, 
xv  =  yu,  and  uz,  =  vw.  We  need  to  show  that  xz  =  yw.  Multiplying  both  sides 
of  the  equation  xv  =  yu  by  z  and  both  sides  of  the  equation  uz  =  vw  by  y, 
we  obtain  the  two  equations  xvz  =  yuz  and  uzy  =  vwy.  Therefore  xvz  =  vwy. 
Now  v  is  nonzero,  so  we  may  cancel  to  obtain  xz  =  yw,  which  shows  that 
(x,y)  ~  (w,z).  Therefore  ~  is  transitive. 

Finally,  =  {(w,z)  G  Z  x  (Z  \  {0})  :  xz  =  yw}.  If  we  think  of  (x,y)  as 
the  rational  number  x/y,  then  ( w,z )  ~  (x,y)  if  and  only  if  xjy  =  w/z-  So  this 
relation  is  a  way  of  identifying  all  fractions  with  the  same  value. 

Solution  (10.7).  A  relation  is  reflexive  if  and  only  if  its  diagram  has  an  arrow  from 
each  vertex  to  itself.  It  is  symmetric  if  and  only  if  whenever  we  have  an  arrow 
between  two  different  vertices,  then  there  is  also  an  arrow  in  the  other  direction 
between  these  two  vertices.  Finally,  a  relation  is  transitive  if  and  only  if  whenever 
an  arrow  goes  from  a  vertex  x  to  a  vertex  y  and  a  second  arrow  from  vertex  y  to  a 
vertex  z,  then  there  is  an  arrow  directly  from  x  to  z.  The  relation  is  an  equivalence 
relation  on  the  set  of  the  vertices  of  the  diagram,  if  the  diagram  satisfies  the  three 
conditions  outlined  above. 

Two  vertices  are  in  the  same  equivalence  class  if  and  only  if  there  is  an  arrow 
between  them.  Thus  two  vertices  are  in  different  equivalence  classes  if  and  only  if 
there  is  no  arrow  between  them.  In  a  diagram  representing  an  equivalence  relation, 
we  call  a  subset  of  the  vertices  with  the  property  that  any  two  vertices  of  the  sub¬ 
set  are  connected  by  arrows  a  component  of  the  diagram.  With  this  language,  the 
equivalence  classes  are  the  components  of  the  diagram. 


Problems 


Problem  10.1.  We  define  several  relations  on  R2  below.  Prove  that  each  is  an  equiv¬ 
alence  relation  and  give  a  geometric  description  of  the  equivalence  classes. 

(a)  We  define  (x,y)  ~  (w,z)  if  and  only  if  x+y  =  w  +  z. 

(b)  We  define  (x,y)  ~  (w,z)  if  and  only  if  x2  +y2  =  w2  +  z2. 

(c)  We  define  (x,y)  ~  (w,z)  if  and  only  if  3 x+y  =  3 w  +  z. 

(d)  We  define  (x,y)  ~  (w,z)  if  and  only  if  x  =  w. 
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Problem  10.2.  Decide  whether  or  not  the  following  relations  are  reflexive,  sym¬ 
metric,  or  transitive.  If  a  property  holds,  prove  that  it  does.  If  a  property  does  not 
hold,  prove  that  it  does  not  hold.  If  the  relation  is  an  equivalence  relation,  give  the 
equivalence  class  of  a  general  point  x  £  X. 

(a)  On  R,  we  define  x  ~  y  if  and  only  if  x  <  y. 

(b)  On  R,  we  define  x  ~  y  if  and  only  if  x  <  y. 

(c)  On  Z,  we  define  x  ~  y  if  and  only  if  x  —  y  is  divisible  by  3. 

(d)  If  X  is  a  nonempty  set,  define  a  relation  on  (X )  by  ,4  ~  B  if  and  only  if 

ACB. 

(e)  If  X  is  a  nonempty  set,  define  a  relation  on  £P(X)  by  A  ~  B  if  and  only  if 
A\By^  0. 

(f)  On  Z,  we  define  x  ~  y  if  and  only  if  |jc|  =  |y|. 

(g)  On  Z+,  we  define  x  ~  y  if  and  only  if  there  exists  a  rational  number  m  such 
that  x  =  ym. 


Problem  10.3.  Define  three  relations  on  R  by  x  ~  y  if  and  only  if  there  exists  n  £  Z 
such  that 

(a)  x,y  £  [n.n  +  1]; 

(b)  x,y  £  [n,n  +  1); 

(c)  x,y  £  [n,n  +  2). 

In  each  case,  determine  whether  we  have  defined  an  equivalence  relation  or  not. 
Give  reasons  for  your  answers. 

Problem  10.4.  If  a,b  £  C,  say  that  a  ~  b  if  and  only  if  ak  =  bk  for  some  positive 
integer  k.  Prove  that  this  is  an  equivalence  relation. 

Problem  10.5.  We  define  a  relation  ~  on  R2  by  (x \ , y | )  ~  (X2.V2)  if  and  only  if 
yo  —  y  1  £  2Z  (see  Example  6.1  for  the  definition  of  2Z).  Show  that  the  relation  ~  is 
an  equivalence  relation  and  describe  the  equivalence  class  of  the  point  (0,1). 

Problem  10.6.  Define  an  equivalence  relation  on  Z  that  has  exactly  three  equiva¬ 
lence  classes. 

Problem  10.7.  LetX  =  {1,2, 3, 4, 5}. 

(a)  If  possible,  define  a  relation  on  X  that  is  an  equivalence  relation. 

(b)  If  possible,  define  a  relation  on  X  that  is  reflexive,  but  neither  symmetric  nor 
transitive. 

(c)  If  possible,  define  a  relation  on  X  that  is  symmetric,  but  neither  reflexive  nor 
transitive. 

(d)  If  possible,  define  a  relation  on  X  that  is  transitive,  but  neither  reflexive  nor 
symmetric. 

Problem  10.8.  Define  a  relation  ~  on  R  as  follows:  For  x,y  £  R,  we  say  x  ~  y  if 
and  only  if  x2  —y2  £  Z. 
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(a)  Prove  that  ~  as  defined  above  is  an  equivalence  relation  on  R. 

(b)  Give  five  different  real  numbers  that  are  in  the  equivalence  class  E 

Problem  10.9.  Define  a  relation  ~  on  E2  as  follows:  For  (xi  ,X2),  (>’i  ,^2)  £  R2.  we 
say  that  (jq  ,X2)  ~  (yi  ,^2)  if  and  only  if  both  xi  —  y\  and  X2  —  yi  are  even  integers. 
Is  this  relation  an  equivalence  relation?  Why  or  why  not? 

Problem*  10.10.  Let  X  be  a  nonempty  set  with  an  equivalence  relation  ~  on  it. 
Prove  that  for  all  elements  x  and  y  in  X,  the  equality  Ex  =  Ey  holds  if  and  only  if 

x~y. 

Problem  10.11.  What,  if  anything,  is  wrong  with  the  following  argument? 

We  claim  that  if  a  relation  on  a  set  X  is  symmetric  and  transitive,  then  it  is  reflex¬ 
ive.  Here’s  a  proof  of  this  claim: 

Proof.  Let  x  £  X.  Let  y  £  X  with  x  ~  y.  By  symmetry  we  have  y  ~  x.  We  now  use 
transitivity  to  conclude  that  x  ~  x.  0 

Problem  10.12.  Give  an  example  of  a  relation  on  Z  x  Z  that  is  not  transitive,  but  is 
reflexive  and  symmetric. 

Problem  10.13.  Recall  that  a  polynomial  p  over  R  is  an  expression  of  the  form 

p(x)  =  a„xn  +an~ ix"_1  H - baix1  +ao  where  each  aj  £  R  and  n  £  N.  The  largest 

integer  j  such  that  aj  f  0  is  the  degree  of  p.  We  define  the  degree  of  the  constant 
polynomial  p  =  0  to  be  —  00.  (A  polynomial  over  M  defines  a  function  p  :  R  — >  R.) 

(a)  Define  a  relation  on  the  set  of  polynomials  by  p  ~  q  if  and  only  if  p( 0)  =  q(0). 
Is  this  an  equivalence  relation?  If  so,  what  is  the  equivalence  class  of  the 
polynomial  given  by  p{x)  =  x? 

(b)  Define  a  relation  on  the  set  of  polynomials  by  p  ~  q  if  and  only  if  the  degree 
of  p  is  the  same  as  the  degree  of  q.  Is  this  an  equivalence  relation?  If  so,  what 
is  Er  if  r(x)  =  3x  +  5? 

(c)  Define  a  relation  on  the  set  of  polynomials  by  p  ~  q  if  and  only  if  the  degree 
of  p  is  less  than  or  equal  to  the  degree  of  q.  Is  this  an  equivalence  relation?  If 
so,  what  is  Er,  where  r(x)  =  x2? 

Problem  10.14.  Figure  10.3  shows  the  diagram  of  a  relation  onl  =  {a,b,c,d,e,f}. 
Is  this  the  diagram  of  an  equivalence  relation  on  XI  Give  reasons  for  your  answer. 
If  it  is  an  equivalence  relation,  find  all  equivalence  classes  of  the  relation. 

Problem  10.15.  We  define  a  relation  on  a  subset  X  of  Z  as  follows.  For  x,y  £  X, 
x  ~  y  if  and  only  if  there  is  a  prime  number  p  such  that  p\x  and  p\y.  For  each  of  the 
two  choices  of  X  below,  draw  a  diagram  of  the  relation.  From  the  diagram  determine 
whether  or  not  the  relation  on  X  is  an  equivalence  relation.  If  it  is  an  equivalence 
relation,  use  the  diagram  to  find  all  equivalence  classes.  How  many  are  there? 

(a)  X={x£Z:l<x<  10}; 

(b)  X  =  {4,5,6,8,11,12,35,143}. 
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Fig.  10.3 


Problem  10.16.  Prove  that  if  x  and  y  are  two  vertices  of  a  diagram  that  represents 
an  equivalence  relation  on  a  set  X  and  x  is  in  the  same  equivalence  class  as  y,  then 
there  are  arrows  pointing  in  both  directions  between  x  and  y. 

Problem  10.17.  We  say  that  a  relation  ~  on  a  set  X  is  antisymmetric  if  for  all 
x,y  £  X,  whenever  x  ~  y  and  y  ~  x,  then  x  =  y.  How  can  you  see  from  the  di¬ 
agram  of  a  relation  that  it  is  antisymmetric?  Explain.  Draw  a  nontrivial  exam¬ 
ple  (at  least  one  arrow  between  two  different  vertices)  of  such  a  diagram  using 
X  =  {0,1,2, 3, 4, 5, 6, 7, 8, 9}. 

Problem#  10.18.  Let  *  be  a  nonempty  set  and  suppose  that  there  is  an  equivalence 
relation  defined  on  *.  Prove  that  the  union  of  all  equivalence  classes  is  *. 

Problem**  10.19.  Given  an  equivalence  relation  on  a  nonempty  set  X,  prove  that  no 
equivalence  class  is  empty. 

Problem  10.20.  Given  three  sets  A.B.  and  C  we  can  define  a  3-adic  relation  among 
them  in  the  following  way.  We  first  define  the  triple  product  of  the  sets  A .  B,  and 
CbyAx5xC={(a,i)1c)  :  a  £  A,b  £  B,  and  c  £  C}.  This  notation  presumes  that 
(ai,fci,ci)  =  (02^2,02)  if  and  only  if  a\  =  a.2,  b\  =  b2,  and  c\  =  C2-  A  3-adic 
relation  among  sets  A.B,  and  C  is  a  subset  of  Ax  B  xC.  Give  a  meaningful  example 
of  a  3-adic  relation  on  A  =  B  =  C  =  Z.  Give  two  triples  that  are  related  in  your 
example  and  two  triples  that  are  not  related. 
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Tips  on  Reading  Mathematics 


Don’t  just  read  it;  fight  it!  Ask  your  own  questions,  look  for  your  own  examples,  discover 
your  own  proofs.  Is  the  hypothesis  necessary?  Is  the  converse  true?  What  happens  in  the 
classical  special  case?  What  about  the  degenerate  cases?  Where  does  the  proof  use  the 
hypothesis? — Paul  R.  Halmos,  [44] 

•  Be  an  active  reader.  Open  to  the  page  you  need  to  read,  get  out  some  paper  and  a 
pencil. 

•  If  notation  is  defined,  make  sure  you  know  what  it  means.  Your  pencil  and  paper 
should  come  in  handy  here. 

•  Look  up  the  definitions  of  all  words  that  you  do  not  understand. 

•  Read  the  statement  of  the  theorem,  corollary,  lemma,  or  example.  Can  you  work 
through  the  details  of  the  proof  by  yourself?  Try.  Even  if  it  feels  like  you  are 
making  no  progress,  you  are  gaining  a  better  understanding  of  what  you  need  to 
do. 

•  Once  you  truly  understand  the  statement  of  what  is  to  be  proven,  you  may 
still  have  trouble  reading  the  proof — even  someone’s  well-written,  clear,  con¬ 
cise  proof.  Try  to  get  the  overall  idea  of  what  the  author  is  doing,  and  then  try 
(again)  to  prove  it  yourself. 

•  If  a  theorem  is  quoted  in  a  proof  and  you  don’t  know  what  it  is,  look  it  up.  Check 
that  the  hypotheses  apply,  and  that  the  conclusion  is  what  the  author  claims  it  is. 

•  Don’t  expect  to  go  quickly.  You  need  to  get  the  overall  idea  as  well  as  the  details. 
This  takes  time. 

•  If  you  are  reading  a  fairly  long  proof,  try  doing  it  in  bits. 

•  If  you  can’t  figure  out  what  the  author  is  doing,  try  to  (if  appropriate)  choose  a 
more  specific  case  and  work  through  the  argument  for  that  specific  case. 

•  Draw  a  picture,  if  appropriate. 

•  If  you  really  can’t  get  it,  do  what  comes  naturally — put  the  book  down  and  come 
back  to  it  later.  You  might  want  to  take  this  time  to  read  similar  proofs  or  some 
examples. 

•  After  reading  a  theorem,  see  if  you  can  restate  it.  Make  sure  you  know  what  the 
theorem  says,  what  it  applies  to,  and  what  it  does  not  apply  to. 

•  After  you  read  the  proof,  try  to  outline  the  technique  and  main  idea  the  author 
used.  Try  to  explain  it  to  a  willing  listener.  If  you  can’t  do  this  without  looking 
back  at  the  proof,  you  probably  didn’t  fully  understand  the  proof.  Read  it  again. 

•  Can  you  prove  anything  else  using  a  similar  proof?  Does  the  proof  remind  you 
of  something  else?  What  are  the  limits  of  this  proof?  This  theorem? 

•  If  your  teacher  is  following  a  book,  read  over  the  proofs  before  you  go  to  class. 
You’ll  be  glad  you  did. 

As  we  proceed,  you  will  have  plenty  of  opportunities  to  try  these  tips  out  and  find 

some  others  of  your  own. 


Chapter  11 

Partitions 


It  is  sometimes  helpful  to  split  a  nonempty  set  into  disjoint  smaller  pieces.  For  ex¬ 
ample,  we  might  have  reason  to  split  the  integers  into  positive  integers,  negative 
integers,  and  the  set  containing  zero  alone.  We  often  split  the  real  numbers  into  ra¬ 
tional  numbers  and  irrational  numbers,  or  we  might  want  to  break  R2  down  into 
distinct  vertical  lines.  All  of  these  are  examples  of  partitioning  a  space. 

Though  we  have  an  intuitive  feel  for  what  a  partition  is,  we  need  a  precise  defi¬ 
nition.  We  turn  to  that  now.  A  partition  of  a  nonempty  set  X  is  a  collection  sif  of 
subsets  of  X  that  satisfies  the  following  three  conditions. 

(i)  Every  set  A  £  si  is  nonempty, 

(ii)  lUe.^A=X’and 

(iii)  for  all  A,B  £  if  ARB  ^  0,  then  A  =  B. 

Looking  back  at  the  example  of  the  integers  that  we  discussed  earlier,  we  see  that 
the  collection  srf  =  {Z+,  Tr ,  {0}}  is  a  partition  of  Z. 

Figure  11.1  provides  a  diagram  of  a  partition  ofX  =  {a.b.c.d.e.f}  intosetsAi, 
A 2,  and  A3,  defined  by 

Ai  =  {a,b},  Ai  =  {c,d,e},  and  A3  =  {/}. 

While  it  is  often  clear  that  the  sets  in  the  collection  are  nonempty,  you  should 
still  check.  Condition  (ii)  says  that  every  element  of  X  is  in  at  least  one  of  the  sets  in 
the  collection.  It’s  a  sort  of  existence  statement:  for  each  element  x  of  X,  there  exists 
a  set  A  in  srf  of  which  x  is  a  member.  The  third  condition  is  a  fancy  way  of  saying 
that  two  of  the  sets  in  the  partition  are  disjoint  or  they  are  equal.  This  is  a  sort  of 
uniqueness  statement:  if  x  belongs  to  two  sets  A  and  B,  then  the  sets  must  be  equal. 

We  hope  the  examples  below  will  clarify  these  concepts. 

Example  11.1.  For  each  x  £  R,  define  Ax  =  {_y  €  R  :  \x\  =  [v }.  We  will  show  that 
{Ax  :x£t}  forms  a  partition  of  R. 

Before  we  begin,  note  that  our  proposed  partition  is  an  indexed  collection  of  sets, 
and  that  in  this  case  it  is  possible  that  two  different  indices  give  rise  to  the  same  set. 

U.  Daepp  and  P.  Gorkin,  Reading,  Writing,  and  Proving:  A  Closer  Look  at  Mathematics ,  111 

Undergraduate  Texts  in  Mathematics,  DOI  10.1 007/978- 1  -44 1 9-9479-0  1 1 , 

©  Springer  Science+Business  Media,  LLC  20 1 1 


112 


1 1  Partitions 


Fig.  11.1  A  partition  of  { a,b,c,d,e,f } 


For  example,  A i  =  {1,-1}  and  A-i  =  {—  1, 1},  so  A  \  DA_i  ^  0.  That  will  not  be  a 
problem  though,  and  it  does  illustrate  why  we  stated  condition  (iii)  the  way  we  did. 
It  happens  that  Ai  flA-i  ^  0,  but  A]  =A_i,  as  required. 

Proof.  We  note  that  all  the  sets  Aa  are  nonempty.  So  there  are  two  things  left  to 
show,  namely,  conditions  (ii)  and  (iii)  of  the  definition  of  partition.  We  begin  by 
showing  that  (ii)  holds;  that  is,  UxgR^c  =  R-  First,  UvgR^v  Q  R  since  each  Ax  C  R. 
We  show  the  reverse  containment  using  an  element-chasing  argument.  Let  y  G  R. 
Then  |y |  =  |y|,  so  y  G  Ay.  Since  y  G  Ax  for  some  x  (namely,  x  =  y),  we  may  conclude 
that  y  G  U.«ek  A-v  Thus  UeM =  R- 

Next,  suppose  that  Ax  IT  Ay  /  0.  Then  we  must  show  that  Ax  =  Ay.  By  our  as¬ 
sumption,  there  exists  z  G  R  such  that  z  G  A*  DAV.  Therefore,  |x|  =  |z|  and  |y|  =  |z|. 
In  particular,  \x\  =  |y|.  So, 

Ax  =  {w  G  R  :  \x\  =  |w|}  =  {w  G  R  :  |y|  =  |w|}  =Ay.  □ 

Well,  we  showed  the  two  sets  Ax  and  Ay  are  equal  with  nary  an  element-chasing 
argument  in  sight.  What  happened?  We  certainly  could  have  started  with  an  element 
from  one  side  and  showed  it  was  in  the  other,  switched  sides,  repeated  what  we  did, 
and  then  concluded  we  were  done.  But  this  is  somewhat  cumbersome  and  doesn’t 
show  us  what  is  really  going  on.  So  from  now  on,  even  though  we  can  use  element¬ 
chasing,  we  are  going  to  use  whatever  produces  the  most  elegant  or  enlightening 
proof. 

Exercise  11.2.  For  each  n  G  N,  let  A„  =  [—n,n].  Show  that  the  collection  srf  =  {A„  : 
n  G  N}  does  not  form  a  partition  of  R.  However,  if  we  define  B„  =  [n,n+  1),  then 
the  collection  =  {Bn  :  n  G  2}  does  partition  R.  O 

In  Example  11.1  and  Exercise  1 1 .2,  the  third  condition  may  have  reminded  you 
of  transitivity.  If  so,  then  it  may  not  surprise  you  to  learn  that  there  is  a  connection 
between  equivalence  relations  and  partitions.  As  we  have  already  seen,  every  equiv¬ 
alence  relation  on  a  set  X  gives  rise  to  equivalence  classes  in  a  natural  way.  These 
equivalence  classes  are  sets  and  as  we  will  see,  these  sets  partition  our  set  X. 
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Conversely,  a  collection  of  sets  that  partitions  a  set  X  gives  rise  to  an  equiva¬ 
lence  relation  on  X.  How?  Well,  we  say  two  elements  in  X  are  related  if  they  belong 
to  the  same  set  of  the  partition.  We  shall  now  show  that  this  relation  is  an  equiva¬ 
lence  relation.  We  can  shorten  our  proof  of  this  theorem,  if  we  first  prove  something 
less  ambitious.  A  helpful  result  that  is  used  to  prove  a  theorem  is  called  a  lemma. 
Lemmas  are  sometimes  of  independent  interest. 

Lemma  11.3.  Let  X  be  a  nonempty  set  and  let  ~  be  an  equivalence  relation  on  X. 
For  tw’o  arbitrary  elements  x  and  y  in  X,  ifEx  (T  Ey  then  Ex  =  Ey. 

The  very  first  thing  we  should  probably  ask  ourselves  before  beginning  our  proof 
is:  What  is  EX1  If  we  don’t  know,  we  can’t  understand  the  proof.  So,  before  reading 
the  proof,  we  recall  the  definition: 

Ex  =  {z£X  :x~  z}. 


Now  the  proof  should  be  easy. 

Proof.  Let  z  £  Ex.  Hence  x  ~  z.  Since  we  assume  that  Ex  fi  Ey  f  0,  we  may  choose 
w  £  Ex  n  Ey.  Thus  w  G  Ex,  and  therefore  x  ~  w.  Similarly,  w  £  Ey  and  therefore 
y  ~  w.  By  symmetry,  w  ~  x.  So  y  ~  w,  w  ~  x,  and  x  ~  z.  By  transitivity,  y  ~  z.  Thus, 
z  £  Ey,  and  we  may  conclude  that  Ex  C  Ey. 

Exactly  the  same  argument  (*)  shows  that  Ey  C  Ex.  Hence  Ex  =  Ey.  □ 

One  comment  on  the  proof  above:  When  we  use  the  words  “exactly  the  same 
argument,”  as  in  (*),  that  means  nothing  would  be  changed  except  (possibly)  the 
symbols.  If  you  use  words  to  that  effect  (like  “similarly”  or  “exactly  as  above”), 
make  sure  that  what  you  say  is  true.  Now  that  we  have  our  lemma,  we  turn  to  the 
proof  of  our  main  theorem. 

Theorem  11.4.  Let  ~  be  an  equivalence  relation  on  a  nonempty  set  X.  Then  the 
indexed  collection  of  equivalence  classes  {Ex  :  x  £  X}  is  a  partition  ofX.  Further¬ 
more,  if  s/  is  a  partition  of  a  nonempty  set  X  and  for  x,y  £  X  we  define  x  ~  y  if  and 
only  ifx,y  £  A  for  some  A  £  sT ,  then  ~  is  an  equivalence  relation  on  X. 

Before  beginning  the  proof,  let’s  reflect  on  what  we  need  to  do.  For  the  first 
assertion  (“{Ex  :  x  £  X}  is  a  partition”)  we  need  to  show  that  the  sets  are  nonempty, 
and  satisfy  conditions  (ii)  and  (iii)  in  the  definition  of  partition. 

What  do  we  expect  to  use?  Our  assumptions,  of  course.  We  are  assuming  ~  is 
an  equivalence  relation,  so  we  should  use  the  fact  that  ~  is  reflexive,  symmetric, 
and  transitive.  But  that’s  only  one  direction — this  would  show  that  an  equivalence 
relation  gives  rise  to  equivalence  classes  and  these,  in  turn,  form  a  partition  of  our 
set. 

For  the  other  direction,  we  want  to  show  that  if  we  have  a  relation  defined  by 
a  partition  sf ,  then  the  relation  is  an  equivalence  relation.  So  that  means  we  must 
show  that  ~  is  reflexive,  symmetric,  and  transitive.  How  will  we  do  that?  Well, 
probably  the  first  thing  to  do  is  to  make  sure  we  know  what  ~  is.  Remember,  x  ~  y 
if  and  only  if  there  exists  A  £  srf  such  that  x,y  £  A.  Now,  finally,  we  may  begin. 
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Proof.  First  we’ll  show  that  given  an  equivalence  relation  on  X,  the  indexed  col¬ 
lection  of  sets  {Ex  :  x  G  X }  forms  a  partition  of  X.  We  first  show  that  each  Ex  is 
nonempty.  Since  the  relation  is  reflexive,  x  ~  x  for  each  x  G  X .  Thus  x  G  Ex  for  each 
x  GX,  and  Ex  0. 

Now  we  need  to  check  condition  (ii):  that  \JyeXEy  =X.  If  x  G  X,  then  we  have 
just  seen  that  x  G  Ex.  This  shows  that  x  G  U yexEy.  Thus  X  C  (J yexEy.  Since  the 
opposite  inclusion  follows  from  the  fact  that  Ey  C  X  for  each  y  G  X,  we  know  that 
X  =  Uve xEy-  Thus,  condition  (ii)  holds. 

To  show  that  condition  (iii)  holds,  suppose  that  forx.y  G  X,  we  have  ExnEy  ^0. 
By  Lemma  1 1 .3,  we  conclude  that  Ex  =  Ey,  and  condition  (iii)  holds.  Thus,  the  set  of 
equivalence  classes  {Ex  :  xG  X }  satisfies  conditions  (i),  (ii),  and  (iii)  and  therefore 
forms  a  partition  of  X. 

To  prove  the  converse,  suppose  that  sf  is  a  partition  of  X.  By  condition  (ii), 
X  =  U AesrfA.  Thus,  for  x  G  X,  there  exists  A  G  £/  such  that  x  G  A.  Since  x  is  in  the 
same  set  as  itself,  x  ~  x.  Since  x  was  arbitrary,  ~  is  reflexive. 

Suppose  now  that  x,y  GX  and  x  ~  y.  Then  there  exists  A  G  srf  such  that  x,y  GA. 
But  if  x,y  G  A,  then  y,x  G  A.  Consequently,  y  ~  x.  Therefore  ~  is  symmetric. 

Finally,  suppose  that  x.y.z  G  X  where  x  ~  y  and  y  ~  z.  We  must  show  that  x  ~  z. 
By  the  definition  of  ~  we  see  that  there  exists  A  G  ■(/  such  that  x,y  G  A,  and  there 
exists  B  G  srf  such  that  y,z  G  B.  Therefore,  A  n  B  f  0.  By  property  (iii)  of  partitions, 
A  =  B.  Thus  x,z  G  A.  Therefore,  x  ~  z,  as  desired.  We  conclude  that  the  partition 
gives  rise  to  an  equivalence  relation,  since  ~  is  symmetric,  transitive,  and  reflexive. 

□ 

We  will  illustrate  this  connection  between  partitions  and  equivalence  relations 
with  two  very  simple  examples. 

Example  11.5.  (a)  Consider  the  setX  =  {1,2, 3, 4, 5},  then  the  collection  of  sets 

=  {{1,2}, {3}, {4, 5}}  is  a  partition  of  X.  Describe  the  corresponding 
equivalence  relation. 

We  have  exactly  the  following  relations  (and  no  others): 

1~1,2~2,3~3,4~4,5~5,1~2,2~1,4~5,5~4 

By  Theorem  1 1.4,  this  relation  is  an  equivalence  relation. 

(b)  We  consider  the  set  Y  =  {1,2,3}  and  define  an  equivalence  relation  on  XX ( Y ) 
by  A  ~  B  if  and  only  if  the  number  of  elements  of  A  is  equal  to  the  number  of 
elements  of  B.  for  A.B  G  XX  (Y). 

This  is  clearly  an  equivalence  relation  and  Theorem  1 1 .4  shows  that  the  fol¬ 
lowing  collection  of  sets,  S3,  is  a  partition  of  !YX (V ): 

^  =  {{®}i  {{1},  {2},  {3}},  {{1,2},  {1,3},  {2,3}},  {{1,2,3}}}.  0 

Theorem  1 1 .4  shows  how  one  can  obtain  a  partition  from  an  equivalence  relation  and 
vice  versa.  Now  suppose  we  start  with  a  partition  sY\  and  use  the  theorem  to  get  a 
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corresponding  equivalence  relation.  Then  we  use  that  equivalence  relation  to  obtain 
a  second  partition  sA £.  What’s  the  relation  between  ,<//\  and  .ft/2  ?  And  what  about 
turning  this  procedure  around:  Say  we  start  with  an  equivalence  relation  R\ ,  and  we 
use  it  to  determine  a  partition.  Then  we  use  this  partition  to  obtain  an  equivalence 
relation  Ih-  What’s  the  relation  between  R\  and  AS?  You’re  probably  thinking  that  in 
both  cases  you  will  end  up  where  you  started.  If  you  work  Problems  1 1.14  and  11.15 
you  will  find  out  that  your  intuition  is  correct:  .ft/|  and  .tA  will  be  the  same,  and  so 
will  R\  and  AS-  We  say  that  Theorem  11.4  provides  a  one-to-one  correspondence 
between  the  equivalence  relations  on  a  nonempty  set  X  and  the  partitions  of  the  set 
X. 

Exercise  11.6.  For  r  £  R,  let  Ar  =  {(x,y)  £  R2  :  x  +  y  =  r}.  Show  that  {Ar :  r  £  R} 
is  a  partition  of  R2.  Then  describe  the  equivalence  relation  and  equivalence  classes 
associated  with  this  partition.  Q 


Definition 

Definition  11.1.  A  partition  of  a  nonempty  set  X  is  a  collection  .ft/  of  subsets  of 
X  that  satisfies  the  following  three  conditions. 

(i)  Every  set  A  £  srf  is  nonempty, 

(ii)  Uag^A  —X,  and 

(iii)  for  all  A,B  £  ^/,  if  AHB  ^  0,  then  A  =  B. 


Solutions  to  Exercises 

Solution  (11.2).  The  collection  ///  does  not  partition  R  because  condition  (iii)  is 
not  satisfied:  A 1  (TA2  /  0,  but  A\  /  At.  The  collection  38  does  partition  R:  For  each 
n  £  Z,  the  set  Bn  is  nonempty,  the  union  of  the  sets  satisfies 

(J  B=  |Jfln  =  IJ  \nAl  +  1)  =  R, 

BeSS  ne  z  «ez 

and  if  Bn,Bm  £  38  with  Bn  C\Bm  ^  0,  then  [n,n  +  1)  D  [m,m  +1)  ^0.  Since  m  and 
n  are  integers,  the  intervals  \n,n+  1 )  and  \m.  in  -f  1 )  are  either  equal  or  disjoint.  We 
conclude  that  [n,n+  1)  =  [111,111+  1);  in  other  words,  B„  =  Bm. 

Solution  (11.6).  Note  that  for  r  £  R,  the  ordered  pair  (0,r)  satisfies  the  condition 
0  +  r  =  r.  Thus  (0,  r)  £  A,-  and  A,-  is  nonempty.  Since  it  is  clear  that  UrGK  A,-  C  M2, 
we  check  the  reverse  inclusion.  If  (u,  v)  £  R2,  then  s  =  u  +  v  £  R  and  consequently 
(u,  v)  £  A^..  Thus  (n,  v)  £  [Jrei9tAr,  and  (Jr€®Ar  =  R2,  completing  the  proof  of  con¬ 
dition  (ii)  in  the  definition  of  partition.  Finally,  suppose  that  Ar  ft  As  /  0.  Then  there 
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exists  (u,  v;)  G  A,-  fl  A5.  By  the  definition  of  Ar  and  As  this  means  that  r  =  u  +  v  =  s. 
Thus,  Ar  =AS,  as  desired. 

We  associate  an  equivalence  relation  on  R2  as  follows.  For  (x,y),(u,v)  G  R2, 
we  will  say  (x,y)  ~  (h,v)  if  and  only  if  x  +  y  =  u  +  v.  By  our  work  above  and 
Theorem  1 1.4,  this  is  an  equivalence  relation  on  Si2.  The  equivalence  classes  are  the 
lines  with  slope  —1. 

In  the  two  exercises  in  this  chapter,  the  third  condition  (of  partition)  is  satisfied 
because  the  indices  (n  and  m  in  Exercise  11.2,  and  r  and  s  in  Exercise  11.6)  are 
equal.  Though  this  can  happen.  Example  11.1  shows  that  the  two  sets  can  be  equal 
without  the  indices  being  equal.  Condition  (iii)  in  the  definition  of  partition  requires 
that  we  show  that  the  two  sets  are  equal — not  the  two  indices. 


Problems 

Problem  11.1.  For  each  of  the  relations  in  Problem  10.2  that  you  determined  to  be 
equivalence  relations,  describe  the  partition  associated  with  it. 

Problem  11.2.  Determine  whether  or  not  the  following  are  equivalence  relations  on 
R2.  If  they  are,  describe  the  partition  associated  with  each: 

(a)  (x,y)  ~  (w,  z)  if  and  only  if  y  =  w, 

(b)  (x,y)  ~  (w,z)  if  and  only  if  x2  =  w2; 

(c)  (x,y)  ~  (w,  z)  if  and  only  if  xw  =  yz. 

Problem  11.3.  None  of  the  following  partitions  R.  Say  precisely  why  the  collection 
of  sets  fails  to  be  a  partition  of  R. 

(a)  {{x  G  R  :  |x|  =  r}  :  r  G  R}; 

(b)  {{x  G  R  :  n  <  |x|  <  n  +  1}  :  n  G  N}; 

(c)  {R\R+,R\R-}. 

Problem  11.4.  (a)  For  each  r  G  R,  let  A,-  =  {(x,y,z)  G  R3  :  x+y  +  z  =  r}.  Is  this 

a  partition  of  R3?  If  so,  give  a  geometric  description  of  the  partitioning  sets, 
(b)  For  each  r  G  R,  let  A,-  =  {(x,y.z)  G  R3  :  x2  +y2  +z2  =  r2}.  Is  this  a  partition 
of  R3?  If  so,  give  a  geometric  description  of  the  partitioning  sets. 

Problem  11.5.  (a)  Let  A  =  {1,2, . . . ,  10}.  Describe  a  partition  of  A  that  gives  rise 

to  five  distinct  partitioning  sets. 

(b)  Describe  a  partition  of  Z  that  gives  rise  to  five  distinct  partitioning  sets. 

(c)  Can  you  describe  a  partition  of  R  that  gives  rise  to  five  distinct  partitioning 
sets? 

Problem  11.6.  (a)  Suppose  that  we  partition  R3  into  horizontal  planes.  What 

equivalence  relation  is  associated  with  this  partition? 
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(b)  Suppose  that  we  partition  BP  into  concentric  spheres,  centered  at  (0,0,0). 
What  equivalence  relation  is  associated  with  this  partition? 

Problem  11.7.  Suppose  that  we  look  at  the  set  X  containing  all  circles  in  the  plane. 
Define  an  equivalence  relation  on  this  set  of  circles  by  c  ~  d  if  and  only  if  the  circles 
c  and  d  have  the  same  center.  Describe  the  partition  associated  with  this  equivalence 
relation. 

Problem  11.8.  Consider  the  set  P  of  polynomials  with  real  coefficients.  Decide 
whether  or  not  each  of  the  following  collection  of  sets  determines  a  partition  of 
P.  If  you  decide  that  it  does  determine  a  partition,  show  it  carefully.  If  you  decide 
that  it  does  not  determine  a  partition,  list  the  part(s)  of  the  definition  that  is  (are) 
not  satisfied  and  justify  your  claim  with  an  example.  (See  Problem  10.13  for  more 
information  about  polynomials.) 

(a)  For  m  £  N,  let  Am  denote  the  set  of  polynomials  of  degree  m.  The  collection 
of  sets  is  {Am  :  m  £  N}. 

(b)  For  c  £  E,  let  Ac  denote  the  set  of  polynomials  p  such  that  p(0)  =  c.  The 
collection  of  sets  is  {Ac  :  c  £  E}. 

(c)  For  a  polynomial  q,  let  Aq  denote  the  set  of  all  polynomials  p  such  that  q  is  a 
factor  of  p\  that  is,  there  is  a  polynomial  r  such  that  p  =  qr.  The  collection  of 
sets  is  {Aq  :  q  £  Pj. 

(d)  For  c  £  E,  let  Ac  denote  the  set  of  polynomials  p  such  that  p{c)  =  0.  The 
collection  of  sets  is  {Ac  :  c  £  E}. 

Problem  11.9.  For  two  nonempty  disjoint  sets,  I  and  J,  let  { A „  :  a  £  1}  be  a  parti¬ 
tion  of  E+  and  {Aa  :  a  £  7}  be  a  partition  of  E~  U{0}.  Prove  that  {Aa  :  a  £IUJ} 
is  a  partition  of  E. 

Problem  11.10.  Let  X  be  a  nonempty  set  and  {Aa  :  a  £  /}  be  a  partition  of  X. 

(a)  Let  B  be  a  subset  ofX  such  that  Aa{3B  7^  0  for  every  a  £  I.  Is  {Aa  CiB  :  a  £  1} 
a  partition  of  B  !  Prove  it  or  give  a  counterexample. 

(b)  Suppose  further  that  Aa  7^  X  for  every  a  £  I.  Is  {X  \An  :  a  £  7}  a  partition  of 
XI  Prove  it  or  show  that  it  is  not  a  partition.  (Make  sure  you  consider  each  of 
the  following  cases:  the  partition  {Aa  :  a  £  1}  has  zero,  one,  two,  or  at  least 
three  elements.) 

Problem  11.11.  Recall  that  for  an  integer  n,  the  symbol  3|  n  means  that  there  exists 
m  £  Z  such  that  n  =  3m.  For  each  integer  i,  where  i  =  0,1,2,  we  define  the  set 
Aj  =  {x  £  Z  :  3|(x  —  t") }.  Show  that  {Ai :  i  =  0,1,2}  is  a  partition  of  Z. 

Problem  11.12.  Let  A  =  {x  £  M  :  x  >  0}  and  B  =  {jc  £  E  :  x  <  0}.  Describe  the 
equivalence  relation  on  E  that  is  associated  with  the  partition  {A,B}  of  E. 

Problem  11.13.  For  each  r  £  E  define  a  subset  of  E3  by 

Ar  =  {(x,y,z )  :y2  +  z2  =  r2}. 
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(a)  Prove  that  {A,- :  r  £  R}  is  a  partition  of  R3. 

(b)  Describe  the  equivalence  relation  associated  with  this  partition  geometrically. 

Problem  11.14.  Let  X  be  a  nonempty  set  with  an  equivalence  relation  ~  on  it.  Ac¬ 
cording  to  Theorem  1 1 .4,  this  equivalence  relation  gives  rise  to  a  partition  on  X 
which  in  turn  defines  a  new  equivalence  relation  ~  on  X.  Prove  that  ~  and  ~  are 
the  same  equivalence  relation  on  X. 

Problem  11.15.  Let  s2  be  a  partition  of  a  nonempty  set  X.  By  Theorem  11.4,  this 
partition  gives  rise  to  an  equivalence  relation  on  X  and  this  equivalence  relation,  in 
turn,  defines  a  new  partition  22  of  X.  Prove  that  s2  =  2$. 

Problem  11.16.  Let  X  =  {r£  Z+  :  x  <  100};  that  is,  X  is  the  set  of  all  integers  from 
1  to  100.  For  each  Y  £  22 (X)  we  define  Ay  =  { Z  £  22 (X)  :  Y  and  Z  have  the  same 
number  of  elements}. 

(a)  Prove  that  {Ay  :  Y  £  2^(X)}  partitions  22 {X). 

(b)  Let  ~  denote  the  equivalence  relation  on  22  (X)  that  is  associated  with  this 
partition  (according  to  Theorem  11.4).  If  possible,  find  A,  B,  and  C  such  that 

1.  A  ~  {1,2,3}  and  A  ^  {1,2,3} 

2.  B  /  {7,8,9} 

3.  C ~ X  and  C^X. 

Problem  11.17.  Let  the  collection  22  =  { Aa  :  a  £  7}  be  a  partition  of  a  nonempty 
set  A  and  22  =  {Bp  :  fi  £  7}  be  a  partition  of  a  nonempty  set  B.  We  define  the 
following  collection  of  sets  W  =  {Aa  x  Bp  :  a  £  I  and  /)£/}. 

(a)  Give  a  “small”  example  of  this  situation  using  A  =  B  =  R  and  two  different 
partitions  22  and  22 ,  each  having  exactly  two  sets.  (Specify  22 ,  22 ,  and  'W .) 

(b)  Does  the  collection  W  for  your  example  above  partition  the  plane  R  x  R?  (No 
proof  needed  for  this  part,  just  state  your  answer.) 

(c)  Prove  that  with  52 ,  22 ,  and  "/2  as  defined  at  the  beginning  of  this  problem,  W 
is  a  partition  of  Ax  B. 

Problem  11.18.  Let  X  be  a  nonempty  set  and  a2  a  partition  of  X.  Is  { 22  {A  ) :  A  £  .&/} 
a  partition  of  22{X)1  If  it  is,  prove  it.  If  it  isn’t,  give  a  simple  example  of  sets  X  and 
s2  such  that  {^(A)  :  A  £  £/}  is  not  a  partition  of  22 (X). 

Problem  11.19.  Consider  the  set  consisting  of  all  three-letter  “words”  that  you  can 
make  with  the  two  letters  O  and  T,  so  that 

S  =  {OOO,  TOO,  OTO,  OOT ,  OTT,  TOT ,  TTO,  TTT} . 


(a)  Define  an  equivalence  relation  on  the  set  S  of  “words”  by  saying  that  x  ~  y 
for  x,y  £  S  if  it  is  possible  to  rearrange  the  letters  in  x  to  obtain  y.  Show  that 
this  is  an  equivalence  relation  and  find  the  partition  s2  associated  with  ~. 
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(b)  Now  define  x  ~i  y  for  x,y  £  S  if  and  only  if  there  is  a  set  A  £  srf  such  that 
x,y  £  A.  Is  this  an  equivalence  relation?  If  so,  find  the  associated  equivalence 
classes. 

(c)  Is  your  answer  to  part  (b)  what  you  expected?  Why  or  why  not?  (If  a  theorem 
applies,  state  it.) 

Problem  11.20.  Do  there  exist  sets  X  for  which  g?(X)  is  a  partition  of  X?  If  so, 
give  an  example  of  such  a  set  X.  If  there  do  not  exist  such  sets,  prove  that  &(X)  is 
not  a  partition  of  X  for  all  sets  X. 

Problem  11.21.  For  k  £  Z,  we  define  =  {x  £  Z  :  x  =  5£  +  k  for  some  £  £Z}. 

(a)  Prove  that  { A :  k  £  Z}  partitions  Z. 

(b)  We  denote  by  ~  the  equivalence  relation  on  Z  that  is  obtained  from  the  par¬ 
tition  of  part  (a).  Give  as  simple  a  description  of  ~  as  possible;  that  is,  given 
two  elements  x  and  y  in  Z,  find  a  simple  condition  “C(x,y)”  on  x  and  y  so  that 
x  ~  y  if  and  only  if  “C(x,y)”  holds. 


Tips  on  Putting  It  All  Together 


In  my  own  writing,  I  average  about  five  pages  a  day.  Unfortunately,  they’re  all  the  same 
page. — Michael  Alley  [5,  p.  246] 

Now  we  will  build  upon  the  foundations  we  have  created. 

•  In  each  section,  work  through  the  definitions.  (Check  “Tips  on  Definitions.”)  If 
you  don’t  know  the  definitions,  you  cannot  get  started.  So  the  first  step  is  to  make 
sure  that  you  have  mastered  them. 

•  Next,  learn  and  understand  all  theorems.  You  don’t  have  to  memorize  their  num¬ 
ber,  of  course,  but  you  should  know  by  name  each  theorem  that  has  a  name.  Make 
sure  you  can  restate  every  theorem  in  the  text  correctly. 

•  If  you  are  asked  to  prove  something,  look  for  a  proof  or  theorem  that  reminds 
you  of  your  problem.  Read  it  over. 

•  If  your  problem  is  too  difficult,  try  a  simpler  one  first. 

•  Whenever  you  claim  something  is  true,  say  why  (at  least  to  yourself,  if  it  is  minor, 
and  to  the  reader,  if  it  is  major).  Is  it  a  definition?  a  theorem?  Are  the  techniques 
the  same  as  in  a  proof  everyone  has  already  seen?  If  you  are  writing  up  your 
homework,  tell  the  grader  which  theorem  (now  you  should  give  a  number)  or 
what  definition  you  are  using. 

•  If  you  can  check  your  solution,  do  so.  Is  your  answer  reasonable? 

•  Does  your  theorem  make  sense?  Does  it  agree  with  other  theorems  in  the  text? 
(It’s  supposed  to  agree,  of  course!)  Does  your  proof  use  everything  you  were 
given? 

•  Your  first  draft  is  precisely  that.  No  one  should  have  to  read  someone  else’s  first 
draft.  Work  out  the  solution,  write  it  up,  put  it  away,  read  it  again,  and  rewrite  it. 


Chapter  12 

Order  in  the  Reals 


You’ve  seen  numbers  ever  since  you’ve  been  in  school,  and  you  know  a  lot  about 
them.  It  is  possible  to  give  them  a  careful  mathematical  foundation.  In  fact,  it’s  pos¬ 
sible  to  construct  the  natural  numbers  (and  you  can  do  so  in  Project  29.3).  Then, 
if  you  try  to  introduce  operations  like  addition  and  subtraction,  you’ll  find  that  you 
are  missing  something:  the  negative  numbers.  So  you  look  at  the  integers,  and  try 
again.  Now,  trying  to  introduce  multiplication  and  division,  you’ll  find  you  are  miss¬ 
ing  something  again:  multiplicative  inverses.  So  you  look  at  the  rational  numbers, 
and  you’ll  find  you  are  missing  something  yet  again.  That  brings  you  to  the  real 
numbers.  Our  ultimate  goal  will  be  to  discuss  what’s  missing  in  Q,  and  to  show  you 
why  R  has  what’s  missing.  This  is  known  as  completeness  of  R.  In  this  and  the  next 
chapter,  we’ll  show  you  some  wonderful  applications  of  completeness. 

What  do  we  mean  by  this  property  “completeness,”  that  R  has,  but  Q  doesn’t?  If 
we  take  a  stroll  along  the  real  number  line,  we  can  walk  right  up  to  any  real  number 
and  it  will  be  there  waiting  for  us.  In  contrast  to  this,  if  we  walk  along  the  real 
line,  this  time  stepping  on  rational  numbers  only,  we  might  walk  right  up  to  where 
we’d  expect  \Jl  to  be,  but  it  will  be  out  having  lunch  with  other  irrationals.  We’ll 
make  this  precise  by  the  end  of  the  chapter.  We  remind  you  that  in  this  text  we  are 
assuming  that  the  real  numbers  satisfy  the  algebraic  and  order  properties  listed  in 
the  Appendix  on  pages  363-364.  We  now  turn  to  the  important  terminology  that  we 
need. 

A  nonempty  subset  A  of  R  is  bounded  above  if  there  is  a  real  number  M  such 
that  x  <  M  for  all  x  £  A.  We  call  a  real  number  M  satisfying  x  <  M  for  all  x  €  A 
an  upper  bound  of  A.  The  nonempty  subset  A  of  R  is  bounded  below  if  there  is 
a  real  number  m  such  that  m  <  x  for  all  x  £  A.  We  call  a  real  number  m  satisfying 
m  <  x  for  all  x  £  A  a  lower  bound  of  A.  We  say  a  nonempty  set  is  bounded  if  it  is 
bounded  above  and  below.  For  example,  the  open  interval  (0, 1)  is  bounded  above, 
since  every  x  £  (0,1)  satisfies  x  <  1.  The  number  1  is  an  upper  bound,  and  so  is  the 
real  number  1.5.  In  fact,  every  number  greater  than  or  equal  to  1  is  an  upper  bound. 
Similarly,  since  x  >  0  for  all  x  «E  (0, 1),  the  set  (0, 1)  is  bounded  below  and  0  is  an 
example  of  a  lower  bound  of  the  set.  Since  (0, 1)  is  bounded  above  and  below,  it  is 
an  example  of  a  bounded  set. 
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1 2  Order  in  the  Reals 


Exercise  12.1.  For  each  of  the  following  sets  of  real  numbers,  decide  whether  it  is 
bounded  above,  bounded  below,  and  (consequently)  whether  or  not  it  is  bounded.  If 
the  set  is  bounded  above,  give  three  different  examples  of  upper  bounds  in  R.  If  the 
set  is  bounded  below,  give  three  different  examples  of  lower  bounds  in  R.  Use  your 
intuition;  we’ll  prove  things  rigorously  later.  The  sets  are: 

(a)  {x  £  R  :  x2  <  5}; 

(b)  {.r€l:  x3  <  5}; 

(c)  jreN:r< 5}; 

(d)  {x  €  Q  :  x2  <  2}.  O 

Sometimes  a  subset  of  R  that  is  bounded  above  contains  a  largest  element,  and  we 
give  this  element  a  special  name:  a  maximum.  The  real  number  M  is  a  maximum  of 
the  set  A,  if  M  £  A  and  x<M  for  all  x  £  A.  We  will  write  M  =  max  A  for  a  maximum 
of  the  set  A.  Note  that  a  maximum  is  an  upper  bound  that  lies  in  the  set  A.  Likewise, 
a  real  number  m  is  a  minimum  of  the  set  A,  if  m  £  A  and  m  <  x  for  all  x  in  A.  We  will 
write  m  =  minA  to  denote  a  minimum  of  the  set  A.  Again,  notice  that  a  minimum  is 
a  lower  bound  that  lies  in  the  set  A. 

It  should  now  be  clear  that  if  a  set  A  has  a  maximum,  then  A  must  be  bounded 
above,  and  if  the  set  has  a  minimum,  then  A  must  be  bounded  below.  What  about 
the  converse? 

Example  12.2.  Give  an  example  of  a  bounded  set  that  has  neither  a  maximum  nor  a 
minimum. 

We  claim  that  the  set  (0,2)  is  bounded  and  has  neither  a  maximum  nor  a  mini¬ 
mum. 

Proof.  For  each  x  £  (0,2),  we  know  that  0  <  x  <  2.  Therefore  0  is  a  lower  bound 
of  the  set  and  2  is  an  upper  bound.  Thus,  (0,2)  is  bounded.  To  see  that  it  has  no 
maximum,  suppose  to  the  contrary  that  s  is  a  maximum  of  the  set  (0,2).  Then,  by 
definition  of  maximum,  s  must  be  in  the  set,  so  0  <  s  <  2.  But  (as  you  can  check) 
0  <  s  <  (2  +  s)/2  <2,  and  therefore  (2  +  s) /2  is  in  the  set  (0, 2)  and  larger  than  s,  a 
contradiction.  In  a  similar  fashion,  you  can  check  that  there  is  no  minimum.  □ 

It  turns  out  that  there  is  an  upper  bound  that  can  help  us  when  we  don’t  have 
a  maximum  (called  the  supremum),  and  a  lower  bound  that  can  help  us  when  we 
don’t  have  a  minimum  (called  the  infimum).  We  define  these  below. 

Let  A  be  a  nonempty  set  of  real  numbers  that  is  bounded  above.  Then  a  real 
number  U  is  said  to  be  a  supremum  of  A  or  least  upper  bound  of  A  if 

(i)  a  <U  for  all  a  €  A,  and 

(ii)  if  M  £  R  satisfies  a  <  M  for  all  a  SA,  then  U  <  M. 

Note  that  (i)  says  that  U  is  an  upper  bound,  while  (ii)  says  that  U  is  least  among 
all  upper  bounds.  While  the  phrase  “least  upper  bound”  is  more  descriptive,  most 
authors  prefer  the  term  “supremum.” 

The  following  lemma  tells  us  that  the  supremum,  when  it  exists,  is  unique. 
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Lemma  12.3.  If  a  nonempty  subset  of  R  has  a  supremum,  then  the  supremum  is 
unique. 

We  first  try  to  understand  the  problem.  Let’s  call  our  set  S.  We  are  not  asking 
whether  or  not  a  supremum  of  S  exists.  What  we  are  trying  to  do  is  to  show  that 
there  cannot  exist  two  different  real  numbers  a  and  b,  such  that  both  a  and  b  fulfill 
the  properties  of  supremum  of  5.  (We’ve  seen  examples  of  sets  with  more  than  one 
upper  bound.  Maybe  there  are  sets  with  more  than  one  least  upper  bound.) 

So  we  turn  to  devising  a  plan.  Let’s  suppose  that  there  are  two  such  numbers  a 
and  b,  and  try  to  show  that  they  must  be  equal. 

Proof  Let  S  be  a  nonempty  subset  of  R.  Suppose  a  and  b  are  two  real  numbers 
that  satisfy  properties  (i)  and  (ii)  in  the  definition  of  supremum.  Then  a  is  an  upper 
bound.  Since  b  is  a  supremum,  property  (ii)  implies  that  b  <  a.  On  the  other  hand, 
since  b  is  an  upper  bound  and  a  is  a  supremum,  property  (ii)  implies  that  a  <  b. 
Thus  a  =  b,  and  we  conclude  that  there  is  at  most  one  supremum.  □ 

From  here  on  in,  we  will  refer  to  “the”  supremum,  and  we  will  denote  the  supre¬ 
mum  of  a  nonempty  set  A  by  sup  A. 

The  last  proof  was  your  first  proof  of  uniqueness.  This  particular  proof  is  fairly 
standard.  You’ll  frequently  be  able  to  prove  uniqueness  by  supposing  that  you  have 
two  such  objects,  and  showing  that  they  must  be  equal. 

Exercise  12.4.  Return  to  Exercise  12.1.  Use  your  intuition  to  decide  which  of  the 
sets  have  a  supremum  in  R.  For  the  sets  that  you  decide  have  a  supremum,  find  a  real 
number  that  you  believe  is  the  supremum.  At  this  point,  you  may  use  your  intuition 
to  find  an  answer.  We  will  ask  you  to  prove  that  your  answer  is  correct  later.  O 

So  we  have  defined  two  notions,  supremum  and  maximum.  What  are  the  differ¬ 
ences  and  what  are  the  similarities?  A  close  look  at  the  definition  shows  that  the 
maximum  of  a  set  A  must  be  in  A,  while  the  supremum  of  A  need  not.  On  the  other 
hand,  we  also  have  the  following. 

Exercise  12.5.  Let  A  be  a  nonempty  subset  of  real  numbers  that  is  bounded  above. 
Show  that  if  A  has  a  maximum  M,  then  M  is  the  supremum  of  A.  Conclude  that  a 
maximum  of  a  set  is  unique  and  thus  we  can  speak  of  “the”  maximum  of  a  set  A — if 
it  exists.  O 

In  the  following  exercise  you  will  define  and  investigate  the  infimum  (or  greatest 
lower  bound)  of  a  set.  This  is  an  important  exercise,  and  we  will  refer  to  it  frequently. 

Exercise  12.6.  Let  A  be  a  nonempty  subset  of  R  that  is  bounded  below. 

(a)  Define  infimum  (or  greatest  lower  bound)  of  the  set  A. 

(b)  Do  what  you  always  do  when  confronted  with  a  new  definition:  Find  exam¬ 
ples  and  nonexamples.  Q 
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The  infimum  of  a  set  A,  denoted  inf  A,  is  also  unique  (Problem  12.17)  and  hence 
we  can  speak  of  “the”  infimum.  If  a  set  A  has  a  minimum,  then  this  minimum  is  the 
infimum  of  the  set.  Thus,  minA  is  also  unique,  if  it  exists. 

Some  students  find  the  words  supremum  and  infimum  difficult  to  remember.  But 
once  you  get  used  to  it,  these  words  will  sound  like  what  they  are:  If  the  supremum 
is  in  the  set,  it’s  the  maximum.  If  it’s  not  in  the  set,  the  supremum  (as  suggested 
by  the  word  “superior”)  lies  above  the  set.  Similarly,  if  the  infimum  is  in  the  set, 
it’s  the  minimum.  If  it’s  not  in  the  set,  then  the  infimum  (as  suggested  by  the  word 
“inferior”)  lies  below  the  set. 

The  next  example  shows  how  to  prove  rigorously  that  a  particular  number  is  the 
infimum  (or  supremum)  of  a  set.  Remember  that  to  show  l  is  the  infimum,  we  must 
show  that  it  is  a  lower  bound,  and  that  if  y  is  another  lower  bound,  then  y  <  t.  We 
will  actually  show  the  contrapositive  of  this  last  assertion:  if  y>£,  then  y  is  not  a 
lower  bound. 

Example  12.7.  Show  that  inf(3,4]  =  3. 

Proof.  We  note  that  3  <  x  for  all  x  G  (3,4].  Therefore,  3  is  a  lower  bound. 

To  see  that  it  is  the  infimum,  we  will  show  that  nothing  larger  can  be  a  lower 
bound.  To  this  end,  let  y  be  chosen  so  that  3  <  y.  if  y  >  4,  then  y  is  not  a  lower  bound 
of  (3,4].  If  y  <  4,  then  (3  +y) /2  is  a  real  number  such  that  3  <  (3  +y)/2  <  y  <  4. 
Therefore  (3+y) /2  G  (3,4]  and  (3+y)/2  <  y.  Thus,  y  is  not  a  lower  bound  of  (3,4]. 
Hence,  3  is  the  infimum  of  (3,4].  □ 

There  is  one  point  in  the  proof  above  that  is  very  important  and,  unfortunately, 
very  easy  to  overlook.  When  we  checked  that  “nothing  larger  than  3  can  be  a  lower 
bound,”  we  chose  y  >  3  and  showed  that  (3  +y) /2  <  y.  This,  alone,  will  not  convince 
someone  that  y  is  not  a  lower  bound  of  (3,4];  if  (3  +y) / 2  is  not  in  the  set  (3,4],  it 
won’t  help  us  at  all.  That’s  why  we  also  checked  that  (3+y) /2  G  (3,4].  Our  point 
is  this:  to  show  yo  is  not  a  lower  bound  of  a  set  S,  you  must  find  something  in  the 
set  S  that  is  smaller  than  yo.  (Keep  this  in  mind  when  you  solve  the  problems;  in 
particular,  when  you  solve  Problem  12.1.) 

Example  12.7  provides  us  with  an  example  of  a  bounded  set  that  has  no  mini¬ 
mum,  but  does  have  an  infimum. 

Exercise  12.8.  Return  to  Exercise  12.4  and  assume  for  now  that  the  real  numbers 
■s/5  and  5 1/3  are  irrational.  (We  will  discuss  this  assumption  in  the  next  chapter.)  If 
the  set  has  an  infimum,  say  what  you  think  the  infimum  is.  Then  give  an  argument 
that  shows  that  your  answers  for  supremum  (from  Exercise  12.4)  and  infimum  for 
the  first  three  sets  are  correct. 

We  saw  in  Example  12.2  that  there  are  sets  that  are  bounded,  but  have  no  max¬ 
imum  or  minimum.  Since  a  maximum  is  a  supremum,  it  may  seem  that  there  exist 
sets  that  are  bounded  above  but  have  no  supremum  in  R.  It  turns  out  that  in  R  this 
is  not  the  case;  the  real  numbers  are  constructed  to  guarantee  the  existence  of  a 
supremum  of  every  bounded  nonempty  set.  This  will  not  be  proved;  in  a  way  it  is 
an  agreement.  The  technical  term  for  such  a  statement  is  axiom. 
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The  completeness  axiom  of  the  reals.  Every  nonempty  subset  of  real  numbers  that 
is  bounded  above  has  a  supremum. 

Exercise  12.9.  State  a  version  of  the  completeness  axiom  of  R  replacing  the  word 
“supremum”  by  the  word  “infimum.”  What  conditions,  if  any,  must  be  placed  on  the 
set?  Prove  that  the  two  versions  are  equivalent.  (Once  you  have  your  version  of  the 
completeness  axiom,  you  may  use  Problem  12.7  to  complete  this  exercise.)  O 

Here’s  an  extremely  useful  consequence  of  the  work  we  have  built  up  in  this 
chapter: 

Theorem  12.10  (Archimedean  property  of  R).  Let  a  and  b  be  two  positive  real 
numbers.  Then  there  exists  a  positive  integer  n  such  that  a  <  nb. 

Proof.  Suppose  that  this  is  not  true;  that  is,  suppose  that  there  are  two  positive  real 
numbers,  a  and  b,  such  that  a  >  nb  for  all  n  £  N.  This  means  that  a  jb  is  an  upper 
bound  of  N.  Therefore,  by  the  completeness  axiom  of  R,  it  follows  that  N  has  a 
supremum,  which  we  will  call  u.  Now  consider  u  —  1.  Since  this  is  less  than  the 
supremum  u,  it  can’t  be  an  upper  bound  of  N.  So  there  exists  m  €  N  with  m  >  u  —  1 . 
Therefore,  m+leN  and  in  +  1  >  u.  Since  no  element  of  N  can  be  greater  than  the 
upper  bound  u,  this  is  a  contradiction.  □ 

When  we  have  a  result  that  follows  from  a  theorem  that  we  just  proved,  we  call  it 
a  corollary.  You  will  show,  in  Problem  12. 14,  that  the  following  is  indeed  a  corollary 
of  the  Archimedean  property. 

Corollary  12.11.  For  every  real  number  a,  there  is  an  integer  n  such  that  a  <  n. 

We  now  turn  to  the  well-ordering  principle  of  N,  which  is  concerned  with  a  fun¬ 
damental  property  of  the  natural  numbers.  There  is  another  important  principle, 
called  the  principle  of  mathematical  induction,  which  we  will  introduce  in  Chap¬ 
ter  18  and  Project  29.3.  If  you  work  the  project,  you  will  learn  that  induction  is  one 
of  the  five  Peano  axioms  that  can  be  used  to  construct  the  natural  numbers.  For  now, 
we  will  state  and  use  the  well-ordering  principle  without  proof.  In  Chapter  18,  we 
will  show  that  the  well-ordering  principle  of  N  and  the  principle  of  mathematical 
induction  are  equivalent. 

Well-ordering  principle  of  the  natural  numbers.  Every  nonempty  subset  of  the  nat¬ 
ural  numbers  contains  a  minimum. 

As  a  consequence  of  the  well-ordering  principle,  we  obtain  an  interesting  theo¬ 
rem  about  where  the  rationals  “live.”  The  next  theorem  suggests  that  they  can  really 
fill  up  space!  The  curious  thing  about  them,  which  we  will  return  to  in  Chapter  23, 
is  that  there  really  aren’t  that  many  of  them. 

Theorem  12.12.  Let  a  and  b  be  two  real  numbers  satisfying  a  <  b.  Then  there  is  a 
rational  number  c  such  that  a  <  c  <  b. 
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The  proof  of  this  result  will  be  much  easier  to  follow  if  you  understand  the  basic 
idea.  It’s  this:  if  the  difference  between  a  and  b  were  greater  than  one,  then  there 
would  have  to  be  an  integer  m  with  a  <  m  <  b  and  we  would  be  done.  Of  course, 
the  difference  does  not  have  to  be  greater  than  one,  but  we  can  sort  of  force  it  to  be: 
Look  at  b  —  a  and  multiply  by  an  integer  n  so  that  n(b  —  a)  >  1 .  Now  the  difference 
between  nb  and  na  is  greater  than  one,  so  there  has  to  be  an  integer  m  between 
them  (but  this  needs  proof).  So  we  will  prove  that  there  exists  an  integer  m  with 
na  <  m  <  nb.  Divide  by  n  to  obtain  the  desired  rational  number,  m/n. 

Proof.  As  you  will  show  in  Problem  12.21,  we  may  assume  without  loss  of  gen¬ 
erality  that  a  >  0.  By  Theorem  12.10  there  is  an  integer  n  such  that  n(b  —a)  >  1. 
Thus, 

nb  >  1  +na.  (12.1) 

Now  consider  the  subset  A  of  N  defined  by  A  =  {r  £  N  :  na  <  r}.  By  Corollary 
12.11,  A  is  nonempty.  The  well-ordering  principle  implies  that  A  has  a  minimum, 
which  we  call  m.  Thus  m  €  A,  and  from  the  definition  of  A  we  see  that  na  <  m\  in 
other  words,  a  <  m/n.  Let  c  be  the  rational  number  m/n.  Then  we  have  the  lower 
inequality,  a<c,  and  we  are  halfway  there.  For  the  upper  inequality,  note  that  in  —  1 
is  not  in  the  set  A  (what  would  happen  if  it  were?)  so  na  >m  —  1.  So,  putting  this 
together  with  equation  12.1  we  get 

nb  >  1  +  na  >  1  +  [m  —  1 ) . 

So  nb  >  m,  and  b  >  m/n.  Now  c  =  m/n  is  a  rational  number  between  a  and  b,  and 
this  completes  the  proof.  □ 


Definitions 

Definition  12.1.  A  nonempty  subset  A  of  M  is  bounded  above,  if  there  is  a  real 
number  M  such  that  x  <  M  for  all  i€A.  We  say  a  nonempty  subset  A  of  R  is 
bounded  below,  if  there  is  a  real  number  m  such  that  m  <  x  for  all  x  £  A. 

Definition  12.2.  A  nonempty  subset  of  R  is  bounded  if  it  is  bounded  above  and 
bounded  below. 

Definition  12.3.  Let  A  be  a  nonempty  subset  of  R.  A  real  number  M  that  satisfies 
x  <  M  for  all  x  £  A  is  called  an  upper  bound  of  A.  We  call  a  real  number  m  that 
satisfies  m  <  x  for  all  x  €  A  a  lower  bound  of  A. 

Definition  12.4.  The  real  number  M  is  the  maximum  of  the  subset  A  of  R,  written 
M  =  maxA ,  if  M  £  A  and  x  <  M  for  all  x  £  A. 

Definition  12.5.  The  real  number  m  is  the  minimum  of  the  subset  A  of  R,  written 
m  =  min  A,  if  m  £  A  and  m  <  x  for  all  x  in  A. 
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Definition  12.6.  Let  A  be  a  nonempty  set  of  real  numbers  that  is  bounded  above. 
Then  a  real  number  U  is  said  to  be  the  supremum  of  A  or  least  upper  bound  of  A, 
written  U  =  sup, 4.  if 

(i)  a  <  [/  for  all  a  £  A,  and 

(ii)  if  M  £  ffi.  satisfies  a<M  for  all  a  £A,  then  U  <M. 

Definition  12.7.  Let  A  be  a  nonempty  set  of  real  numbers  that  is  bounded  below. 
Then  a  real  number  t  is  said  to  be  the  infimum  of  A  or  greatest  lower  bound  of  A, 
written  l  =  infA,  if 

(i)  £<b  for  all  b  G  A,  and 

(ii)  if  m  £  K.  satisfies  m  <  b  for  all  b  £A,  then  m  <  £. 


Solutions  to  Exercises 

Solution  (12.1).  We  include  brief  answers  to  each  part  here. 

(a)  This  set  is  bounded,  and  therefore  bounded  above  and  below.  Some  possible 
upper  bounds  are  V 5,  3,  and  121 .  Some  possible  lower  bounds  are  —a/5,  —  10, 
and  —2  K. 

(b)  This  set  is  bounded  above,  and  it  is  not  bounded  below.  Some  possible  upper 
bounds  are  51/3,  10,  and  21.3. 

(c)  This  set  is  bounded,  and  therefore  bounded  above  and  below.  Some  possible 
upper  bounds  are  5,  121,  and  1000.  Some  possible  lower  bounds  are  0,  —3, 
and  —12. 

(d)  This  set  is  bounded  above  and  below,  and  therefore  bounded.  Every  real  num¬ 
ber  greater  than  or  equal  to  y/2  will  work  as  an  upper  bound,  and  every  real 
number  less  than  or  equal  to  —  V2  will  work  as  a  lower  bound. 

Solution  (12.4).  Our  intuition  tells  us  that  we  have  sup{x  £  R  :  x2  <  5}  =  y/5, 
sup{x  £  R  :  x3  <  5}  =  51/3,  sup{x  £  N :  x  <  5}  =  5,  and  sup{x  £  Q  :  x2  <  2}  =  y/2. 

Solution  (12.5).  Let  M  =  maxA.  Then  a  <  M  for  all  a  £  A  and  property  (i)  of  the 
definition  of  supremum  is  fulfilled.  Now  suppose  that  K  is  a  real  number  satisfying 
a<K  for  all  a£A.  Since  M  is  in  A  we  have  in  particular  that  M  <K.  Thus,  property 
(ii)  holds  and  M  =  sup  A. 

By  Lemma  12.3,  the  supremum  of  A  is  unique.  Since  maxA  is  the  supremum  of 
A,  it  is  also  unique. 

Solution  (12.6). 

(a)  Let  A  be  a  nonempty  set  of  real  numbers  that  is  bounded  below.  A  real  number 
m  is  the  infimum  (or  greatest  lower  bound)  of  A  if 

(i)  a  >  m  for  all  a£A,  and 

(ii)  if  y  is  a  real  number  satisfying  a  >  y  for  all  a  £  A,  then  m  >  y. 
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(b)  Let  A  =  {x£  R  :  x  >  10}.  Then  A  is  bounded  below  (e.g.,  by  0),  nonempty 
(e.g.,  20  £  A),  and  inL4  =  10.  This  can  be  seen  as  follows. 

(i)  a  >  10  for  all  a  £  A  by  the  definition  of  A. 

(ii)  If  y  is  a  real  number  satisfying  a  >  y  for  all  a  £  A  and  y  >  10,  then 
10  <  (y  +  10)  /2  <  y.  Thus  (y  +  10)  /2  £  A  and  (y  +  10)  /2  y.  This  is 
a  contradiction  and  it  follows  that  y  <  10. 

These  two  conditions  imply  that  inf  A  =  10. 

Using  the  set  A  above,  0  -/=  infA  and  20  f  infA,  giving  two  nonexamples. 

We  do  not  seem  to  be  able  to  find  a  nonempty  set  that  is  bounded  below  and 
does  not  have  an  infimum.  That  no  such  set  exists  will  follow  from  Exer¬ 
cise  12.9  below. 

Solution  (12.8). 

In  this  problem  we  assume  that  there  is  a  real  number,  x,  with  the  property  that 
x2  =  5.  Similarly,  we  assume  that  there  is  a  real  number  y  with  the  property  that 
y3  =  5.  A  rigorous  argument  for  the  existence  of  x  will  be  given  in  Theorem  13.2. 

First,  let  S  =  {x  £  R  :  x2  <  5}.  We  claim  that  m  =  —V5  =  infS.  If  x  £  S,  then 
x2  <  5  =  m 2.  Thus,  |x|  <  \m\.  Since  m  <  0,  we  have  m  <  |x|  <  —m.  So  we  conclude 
that  m  <  x.  Thus  m  is  a  lower  bound  of  5.  If  v  is  any  other  lower  bound  of  S,  then 
v  <  —  s/5  because  —  \/5  £  S.  Thus  v2  >  5  =  m2  and  hence  v  <  m.  This  establishes 
the  claim.  (This  illustrates  Exercise  12.5.  We  could  also  have  used  the  result  of  this 
exercise  and  we  will,  from  here  on.)  An  entirely  similar  proof  shows  that  sup  S  =  y/5. 

For  the  second  set,  T  =  {x  £  R:x3<  5},  note  that  T  is  not  bounded  below.  Hence 
it  has  no  infimum.  In  Exercise  12.4  we  claimed  that  supT  =  51/3.  Set  M  =  51/3.  If 
x  £  T,  then  x3  <  5  =  M3.  Hence  x<M,  showing  that  M  is  an  upper  bound.  Suppose 
that  U  is  an  upper  bound  of  T  and  that  U  <  M.  Then  U  <  (U  +M)  /2  <  M.  Thus 
(U  +  M)/ 2)3  <  A/3  =  5.  Hence  (U  +  M)/2  £  T .  This  shows  that  U  cannot  be  an 
upper  bound  and  implies  that  M  =  sup  7’. 

Finally,  the  third  set  is  simply  W  =  {0, 1,2, 3, 4, 5}.  We  see  that  0  is  the  minimum 
and  5  is  the  maximum.  From  Exercise  12.5,  we  conclude  that  the  infimum  is  0  and 
the  supremum  is  5. 

We  note  here  that  the  fourth  set  is  more  complicated  because  of  the  requirement 
that  x  be  a  member  of  Q. 

Solution  (12.9).  Completeness  axiom  of  R;  infimum  version.  Every  nonempty 
subset  of  real  numbers  that  is  bounded  below  has  an  infimum. 

To  show  the  equivalence  of  the  two  versions  we  first  assume  that  every  nonempty 
set  of  real  numbers  that  is  bounded  above  has  a  supremum.  Let  S  be  a  nonempty 
subset  of  the  reals  that  is  bounded  below.  Define  T  =  {x  £  R  :  — x  £  S}.  By  the 
result  of  Problem  12.7,  we  conclude  that  T  is  bounded  above.  By  the  completeness 
axiom  of  R,  the  set  T  has  a  supremum.  Again  using  the  result  of  Problem  12.7,  we 
conclude  that  the  set  S  has  an  infimum.  This  establishes  the  infimum  version  of  the 
axiom. 

A  similar  proof  establishes  the  fact  that  the  infimum  version  of  the  completeness 
axiom  implies  the  supremum  version. 
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Problems 

Problem  12.1.  A  student  solved  the  following  problem:  Let  S  =  Q  IT  (0,4).  Show 
that  sup. S'  =  4.  Below  is  the  student’s  solution.  Criticize  it. 

Not  a  proof  (Student  Solution).  First  we  show  that  4  is  an  upper  bound.  Let  x  £  S. 
Then  0  <  x  <  4,  so  4  is  clearly  an  upper  bound.  Suppose  to  the  contrary  that  4  is  not 
the  supremum.  Then  there  exists  an  upper  bound  u  with  u  <4.  But  u  <  (u  +  4)  /2, 
and  we  have  shown  that  u  is  not  an  upper  bound  of  S.  This  shows  that  4  must  be  the 
supremum.  0 

Problem  12.2.  Consider  the  sets  below.  For  each  one,  decide  whether  the  set  is 
bounded  above.  If  it  is,  give  the  supremum  in  K.  Then  decide  whether  or  not  the 
set  is  bounded  below.  If  it  is,  give  the  infimum.  Finally,  decide  whether  or  not  the 
supremum  is  a  maximum,  and  whether  or  not  the  infimum  is  a  minimum: 

(a)  The  closed  interval  [0, 4] ; 

(b)  The  open  interval  (0,4); 

(c)  The  natural  numbers  N; 

(d)  The  set  [0,^2]  HQ. 

Problem  12.3.  Consider  the  interval  (1,4)  in  R.  Show  in  detail 

(a)  that  4  is  the  supremum,  and 

(b)  that  1. 1  is  not  a  lower  bound. 

Problem  12.4.  Let  X  =  ^F(N)  \  {0}.  For  R,S  £  X  we  define  R  ~  S  if  and  only  if 
min/?  =  min^. 

(a)  Give  an  example  of  two  different  elements  of  X  that  are  related  to  each  other 
with  respect  to  the  relation  ~  defined  above. 

(b)  Prove  that  ~  as  defined  above  is  an  equivalence  relation  on  X. 

Problem  12.5.  Show  that  inf{  1  /n  :  n  £  Z+}  =  0. 

Problem  12.6.  Show  that  sup{l  —  1  /n:n  £  Z+}  =  1. 

Problem#  12.7.  Let  SCK  and  define  T  =  {r  £  R  :  —  x  £  S}. 

(a)  Prove  that  if  S  is  nonempty  and  bounded  below,  then  T  is  bounded  above. 
Further,  prove  that  if  sup  7’  exists,  then  infS  exists  and  inf. S'  =  —  sup  7’. 

(b)  Prove  that  if  S  is  nonempty  and  bounded  above,  then  T  is  bounded  below. 
Further,  prove  that  if  i ill  7'  exists,  then  sup. S’  exists  and  sup. S'  =  —  intT. 

Problem*  12.8.  Let  S  be  a  nonempty  subset  of  R.  Prove  that  S  is  bounded  if  and 
only  if  there  exists  MeR  such  that  \x\<M  for  all  x  £  S. 

Problem*  12.9.  Let  S  be  a  nonempty  bounded  subset  of  R.  Show  that  inf  S  <  sup  S. 
Under  what  conditions  on  S  would  you  have  inf 5  =  sup. S’? 
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Problem  12.10.  Let  S  be  a  nonempty  bounded  subset  of  R  and  let  u  be  a  real  number 
such  that  m  <  sup. S'.  Show  that  there  exists  sgS  such  that  u  <  s. 

Problem  12.11.  Let  S  and  T  be  nonempty  bounded  subsets  of  R. 

(a)  Show  that  sup(Sur)  >  sup S,  and  sup(Sur)  >  sup7\ 

(b)  Show  that  sup(Sur)  =  maxjsupSjSupr}. 

(c)  Try  to  state  the  results  of  (a)  and  (b)  in  English,  without  using  mathematical 
symbols. 

Problem  12.12.  Let  reK  and  let  S  be  a  nonempty  subset  of  R  that  is  bounded 
above.  We  define  a  new  set,  x  +  S,  byx  +  S  =  {x  +  s:sS5'}. 

(a)  Prove  that  x  +  S  is  bounded  above. 

(b)  Prove  that  x+  sup. S'  is  an  upper  bound  of  x  +  S.  Using  this  result,  conclude 
that  sup(x  +  S)  <x  +  supS. 

(c)  Prove  that  x  +  sup S  =  sup(x  +  S) . 

Problem  12.13.  Let  A  and  B  denote  nonempty  bounded  subsets  of  R  and  define 

A(BB  =  {x  +  y  :  x  £  A,y  £  B}. 

(a)  Prove  that  A  ©  B  is  also  a  nonempty  bounded  subset  of  R. 

(b)  LetA  =  [1,3]  andfi  =  [2,4].  Find  A©B,  supA,  supB,  and  sup(A©B). 

(c)  Show  that  sup(A ©B)  <  supA  +  supB. 

(d)  Show  that  supA  +  supB  <  sup(A©B). 

Problem*  12.14.  Prove  Corollary  12.11. 

Problem  12.15.  Find  the  supremum  and  infimum  of  the  set  (0,2)  ITQ  and  justify 
your  answer.  (If  you  haven’t  worked  Problem  12.1,  you  might  want  to  work  it  before 
solving  this  problem.) 

Problem  12.16.  Let  e  be  a  positive  real  number.  Prove  that  for  every  real  number  a, 
there  exists  a  rational  number  b  (depending  on  a)  such  that  \a  —  b\  <  £. 

Problem*  12.17.  Prove  that  if  a  subset  A  of  the  reals  has  an  infimum,  then  the  infi¬ 
mum  is  unique. 

Problem  12.18.  Prove  that  every  nonempty  bounded  subset  of  N  has  a  maximum. 

Problem  12.19.  Show  that  there  does  not  exist  a  real  number  b  such  that  a  <b  for 

all  a  €  R. 

Problem  12.20.  Let  a  £  Q,  a  ^  0,  and  b  £  l\Q.  Prove  the  following: 

(a)  a  +  b£  R\Q; 

(b)  ab£R\Q- 

(c)  l/b£R\Q. 
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Problem#  12.21.  Suppose  that  we  have  established  Theorem  12.10  for  the  case 
when  a  >  0;  that  is,  suppose  we  know  that  if  a  and  b  are  real  numbers  and  0  <  a  <  b, 
then  there  exists  a  rational  number  r  with  a  <r  <b.  Show  that  it  follows  that  for  all 
real  numbers  x  and  y  with  x  <  y,  there  is  a  rational  number  s  with  x  <  ,v  <  v.  (This 
is  what  we  meant  by  “we  may  assume  without  loss  of  generality  that  a  >  0.”  We 
suggest  you  add  an  appropriate  real  number  M  to  x  and  y  to  reduce  your  problem  to 
the  case  0  <  a  <  b.) 


Chapter  13 

Consequences  of  the  Completeness  of  M 


We  have  now  reached  the  point  at  which  we  can  give  rigorous  proofs  of  two  facts  you 
have  known  for  some  time.  We  mention  the  second  first;  that  is,  we  will  conclude 
this  chapter  by  showing  that  there  is  a  nonempty  bounded  subset  of  Q  that  does  not 
have  a  supremum  in  Q.  As  an  exercise,  you  will  show  that  there  is  also  a  nonempty 
bounded  subset  of  Q  that  does  not  have  an  infimum  in  Q.  In  this  sense,  Q  is  not 
complete.  Since  R  is  complete,  there  must  be  numbers  in  M  that  are  not  in  Q.  We 
know  what  these  numbers  are,  of  course;  they  are  the  irrational  numbers.  Thus  far 
we  haven’t  proven  that  a  particular  real  number  is  irrational.  But  now  we  can!  This 
will  be  the  first  fact  that  we  prove.  Here’s  a  rough  outline:  We  will  show  that  if  a  is 
a  positive  real  number,  then  a  has  a  positive  square  root;  that  is,  there  exists  x  £  R+ 
such  that  x2  =  a.  But  Theorem  5.2  told  us  that  v 2  is  not  rational.  Thus,  we  know 
V2  exists  and  we  know  it  is  not  rational.  Therefore,  we  have  a  rigorous  proof  that 
V2  is  irrational. 

This  might  be  a  good  time  to  put  in  a  plug  for  a  project:  It’s  much  harder  than 
you  might  think  to  prove  that  a  particular  number  is  irrational.  We  take  for  granted 
that  e  and  n  are  irrational,  but  proving  this  is  really  not  so  easy.  You  can  do  so  if  you 
work  Project  29.5. 

Before  we  get  to  the  heart  of  this  chapter,  we  will  get  a  little  more  practice  with 
least  and  greatest  lower  bound.  You  should  solve  the  exercise  below  before  proceed¬ 
ing  to  the  next  theorem. 

Exercise  13.1.  Let  a  =  supjw  £  R+  :  w2  <  2}.  Show  that  if  0  <  c  <  a,  then  c2  <  2. 

o 

Theorem  13.2  (Existence  of  square  roots  in  R+).  There  exists  a  positive  real  num¬ 
ber  a  such  that  a2  =  2. 

In  order  to  help  you  better  understand  this  proof,  we’ll  indicate  how  we  “devised 
a  plan.”  The  basic  idea  is  that  the  square  root  should  be  the  supremum  of  the  set 
A  =  {w  £  R+  :  vv2  <  2}.  The  completeness  axiom  tells  us  this  supremum  exists,  so 
we’ll  call  it  a.  How  do  we  show  that  a  is  the  square  root  of  2?  Well,  we  need  to 
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show  that  a 2  =  2.  If  a 2  >  2,  our  intuition  tells  us  that  we  should  be  able  to  subtract 
something  off  of  a,  which  we’ll  call  x,  and  come  up  with  something  smaller  than  a 
that  is  still  an  upper  bound  of  the  set.  Thus  a—x  would  be  an  upper  bound  smaller 
than  the  supremum,  and  this  would  contradict  the  fact  that  a  is  the  least  upper  bound. 
So  we  now  have  to  worry  about  the  case  in  which  a2  <2.  In  this  case,  our  intuition 
tells  us  that  we  should  be  able  to  add  just  a  little  bit  to  a,  which  we’ll  call  y,  and 
find  an  element  of  A,  namely,  a+y,  that  is  bigger  than  the  upper  bound  a.  This  now 
contradicts  the  fact  that  a  is  an  upper  bound.  So,  since  neither  of  these  two  cases  can 
occur,  the  only  other  possibility,  a2  =  2,  must  be  the  one  that  holds. 

Proof.  Let  A  =  {w  £  R+  :  w2  <  2}.  Then  A  is  a  nonempty  subset  of  R  (since  1  £  A), 
and  A  is  bounded  above  (by,  for  example,  2).  By  the  completeness  axiom  A  has  a 
supremum,  which  we  denote  by  a.  What  we  know  so  far  is  that  a  is  a  real  number, 
and  1  <  a  <  2.  We  will  show  that  a2  =  2. 

We  know  that  one  of  the  following  three  cases  must  occur:  cr  >  2,  cr  <  2,  or 
a 2  =  2.  We’ll  show  that  the  first  two  cases  are  impossible. 

Case  1.  Suppose  that  cr  >  2.  Let  x  =  (a2  —  2)/ (2a).  (We’ll  explain  the  choice  of 
this  x  after  the  proof.)  Then,  as  you  can  check,  0  <  a  —  x  <  a.  By  Exercise  13.1  we 
know  that  (a—x)2  <2.  But 

(a—x)2  =  a2  —  2a  +x2  >  a2  —  2 ax  =  2. 

So  (a  —  x)2  <  2  and  (a  —  x)2  >  2.  This  is  impossible  and  we  conclude  that  Case  1 
cannot  occur. 

Case  2.  Suppose  that  a 2  <  2.  Let  y  =  (2  —  a2) / (3a).  (You’ll  explain  the  choice  of 
this  y  in  Problem  13.10,  after  you  have  read  our  explanation  for  the  choice  of  x  for 
Case  1.)  Then  y  >  0,  and  therefore  a  +  y  >  a.  Since  a  is  an  upper  bound  of  A,  we 
see  that  a+y  ^  A.  Thus  (a+y)2  >  2.  We  will  show  that y  <  a,  and  then  use  this  to 
obtain  our  contradiction.  To  this  end,  note  that  because  1  <  a  we  have 

a  —  y=  (4a2  —  2)/(3 a)  >  0. 

Thus,  y  <  a,  as  claimed.  Since  y  <  a.  we  also  know  that  y2  <  ay.  Thus, 

(a+y)2  =  a2  +2ay  +y2  <  a2  +2ay  +  ay  =  a2  +  3 ay  =  2. 

So,  (a+y)2  >  2  and  (a+y)2  <  2.  This  is  impossible,  and  we  conclude  that  Case  2 
cannot  occur. 

Thus  we  conclude  that  the  only  remaining  possibility  holds,  and  therefore  a2  =  2. 

□ 

We  promised  an  explanation  for  our  choice  of  x.  Here  it  is:  We  wanted  x  to  satisfy 
two  conditions,  0  <  x  <  a  and  (a  -x)2>  2.  The  second  condition  is  difficult  to  solve 
(without  resorting  to  \/T).  Let’s  concentrate  on  (a—x)2  >  2.  Once  we  know  which 
x  satisfy  this  inequality,  we’ll  worry  about  satisfying  0  <  x  <  a.  So  we  want  to 
solve  (a—x)2  >  2,  and  we  will  do  so  by  following  Polya’s  advice  about  simplifying 
the  problem.  We  note  that  (a  —  x)2  =  a2  —  2 ax  +  x2  >  a2  —  2 ax.  Now,  if  we  make 
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a 2  —  lax  =  2,  then  we  have  also  made  {a  —  x)2  >  2.  So,  now  we  have  arrived  at  an 
easier  problem:  Find  x  such  that  a 2  —  lax  =  2.  Solving  this  we  get  x  =  (a1  —  2) /2a. 
This  will  be  our  first  guess  for  x.  If  it  doesn’t  satisfy  0  <x  <  a,  we’ll  need  to  adjust 
it.  But  we  know  that  l<a<2sox<2,  and  we  suppose  that  a2  >  2  so  0  <  x.  Thus, 
0  <  x  <  2,  as  desired.  The  proof  shows  that,  in  fact,  this  choice  does  everything  it 
needs  to  do. 

A  similar  proof  can  be  used  to  show  that  for  every  x  >  0  and  every  positive  integer 
n ,  there  exists  a  unique  real  number  a  >  0  such  that  a"  =  x.  Uniqueness  follows 
easily  from  the  fact  that  if  a,b  >  0  and  a  <  b,  then  an  <  b".  (See  [92,  p.  10]  for  a 
proof  of  the  existence  and  uniqueness  of  nth  roots.)  This  real  number  is  sometimes 
called  the  principal  nth  root  of  x.  From  here  on,  we  will  assume  the  existence  of  nth 
roots  and  we  will  use  the  usual  rules  of  exponentiation. 

The  completeness  axiom  of  R  says  that  if  we  start  with  a  nonempty  set  that  is 
bounded  above,  we  can  find  its  supremum.  In  Q,  this  is  not  the  case:  We  can  find  a 
nonempty  subset  of  Q  that  is  bounded  above  but  has  no  supremum  in  Q.  In  other 
words,  there  is  no  completeness  axiom  for  Q.  There  are  simpler  examples  of  sets 
that  are  not  complete  and  we  provide  one  such  example  below. 

Exercise  13.3.  Show  that  if  S  =  [0, 1)  U  (1,2],  there  is  a  nonempty  bounded  subset 
T  of  S  for  which  there  is  no  b  £  S  that  satisfies  both 

(i)  x<b  for  all  x  £  T,  and 

(ii)  if  c  £  S  and  x  <  c  for  all  x  £  T,  then  b  <  c.  O 

There  are  many  subsets  of  R  that  are  complete  and  there  are  many  that  are  not 
complete,  but  it  is  not  our  goal  in  this  chapter  to  discuss  the  completeness  properties 
of  all  subsets  of  R.  Dealing  with  subsets  can  be  a  bit  tricky,  and  we  prefer  not  to 
get  into  the  details  in  this  text.  Instead,  we  will  focus  on  one  of  the  most  important 
subsets  of  R — the  rationals. 

Example  13.4.  Show  that  the  set  Q  is  not  complete;  that  is,  show  that  there  is  a 
nonempty  set  B  of  rational  numbers  that  is  bounded  above,  but  no  rational  number 
b  satisfies  both 

(i)  x  <  b  for  all  x  £  B,  and 

(ii)  if  c  £  Q  and  x  <  c  for  all  x  £  B,  then  b  <  c. 

Proof.  Let  B  =  {x£  Q+  :  x2  <  2}.  Then  B  is  nonempty  (1  £  B )  and  bounded  above 
(2  is  an  upper  bound).  Suppose  to  the  contrary  that  the  rational  number  b  satisfies 
conditions  (i)  and  (ii)  above.  If  such  a  rational  number  b  exists,  it  must  satisfy  one  of 
the  following  three  things:  b  =  y/l,  b>V 2,  or  b  <  \fl.  We’ll  show  that  no  rational 
number  can  satisfy  one  of  these. 

We  know  from  Theorem  5.2  that  \fl  is  not  rational,  and  we  know  from  our  as¬ 
sumptions  that  b  is  rational.  So  h  f  \/2. 

Suppose  that  b  >  \Jl.  By  Theorem  12.12,  there  is  a  rational  number  c  such  that 
\Jl  <  c  <b.  Now  the  supremum  of  the  set  A  =  {x  £  M+  :  x2  <  2}  is,  as  we  have  just 
seen  in  Theorem  13.2,  y/2.  For  every  x  £  B  we  know  that  x  £  A  and  consequently 
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x  <  \J2  <  c.  Thus  c  satisfies  the  hypotheses  of  (ii).  But  c  <  b,  and  therefore  c  does 
not  satisfy  the  conclusion  of  (ii).  This  implies  that  this  case  cannot  occur. 

Now  suppose  that  b  <  \J2.  By  Theorem  12.12,  there  is  a  rational  number  c  with 
b  <  c  <  \[2.  The  right  side  of  this  inequality  (together  with  the  fact  that  c  >  0)  tells 
us  that  c2  <  2  and  therefore  c  £  B.  But  c  >  b,  and  this  contradicts  the  fact  that  b 
satisfies  condition  (i)  above.  This  implies  that  this  case  cannot  occur. 

Thus  we  conclude  that  there  is  no  rational  number  satisfying  conditions  (i)  and 

(ii)  above.  □ 

Exercise  13.5.  We  showed  that  Q  is  not  complete  by  showing  that  when  we  con¬ 
sider  the  set  B  =  {x  £  @+:x2<  2},  there  is  no  b  £  Q  satisfying  (i)  and  (ii)  of 
Example  13.4.  Using  that  result,  show  that  there  is  a  nonempty  bounded  set  C  in  Q 
for  which  there  is  no  u  £  Q  satisfying 

(i)  w>u  for  all  w  £  C,  and 

(ii)  if  z  £  Q  and  w  >z  for  all  w  £C,  then  u  >  z.  O 

By  the  way,  there  is  still  something  “missing”  in  ffi. — the  square  root  of  —  1. 
So  you  might  decide  to  look  at  complex  numbers  . . .  but  that’s  another  story.  If 
it’s  a  story  you  are  interested  in  learning  more  about,  you  can  begin  by  looking  at 
Project  29.6  and  the  references  that  appear  there. 


Definitions 

Definition  13.1  (for  Problems  13.13  through  13.15).  The  relation  ^  on  a  nonempty 
set  S  is  called  a  partial  order  if  the  following  three  conditions  are  satisfied: 

(i)  (Reflexive  property)  For  all  x  £  S,  we  have  x  A  x. 

(ii)  (Transitive  property)  For  all  x,y,z£  S,  if  x  A  y  and  y  A  z,  then  x<z. 

(iii)  (Antisymmetric  property)  For  all  x,y  £  S,  if  x  A  y  and  y  A  x,  then  x  =  y. 

Definition  13.2  (for  Problems  13.13  through  13.15).  The  relation  A  on  a  nonempty 
set  S  is  called  a  total  order  if  it  is  a  partial  order  with  the  following  additional  prop¬ 
erty: 

(iv)  For  all  x,y  £  S,  we  have  x  A  y  or  y  A  x. 


Solutions  to  Exercises 

Solution  (13.1).  Since  c  <  a,  we  know  that  c  ^  supjvv  £  R+  :  w2  <  2}.  Therefore, 
there  exists  w  £  R+  such  that  w2  <  2  and  c  <  w.  Since  c  >  0,  we  have  c 2  <  cw.  Since 
w  >  0,  we  have  cw  <w2.  Putting  this  together,  we  have  c2  <  w2  <  2,  as  desired. 
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Solution  (13.3).  Claim:  If  T  =  [0, 1),  then  there  is  no  element  b  £  S  =  [0. 1 )  IJ  (1,2] 
satisfying  the  stated  conditions  for  this  T :  Suppose  there  were  such  an  element.  We 
cannot  have  b=  1,  because  b  £  S  and  1  ^  S.  Thus,  b  <  1  or  b  >  1. 

Suppose  b  <  1,  then  (b+  l)/2  <  1  and,  since  b  £  S,  we  have  (b+  l)/2  >  0.  So 
(b+  l)/2  £T  and  (b+  l)/2  >  b,  which  contradicts  (i). 

Therefore,  we  must  have  b  >  1.  In  this  case,  we  consider  (b+  l)/2  >  1.  Then 
1  <  (b+l)/2  <  b  <2.  But  then  c=  (b+  1) /2  £  S  and  t  <  c  for  all  t  £  T.  Therefore 
c  satisfies  the  hypothesis  of  (ii),  but  not  the  conclusion  since  c  <  b.  This  completes 
the  proof. 

Solution  (13.5).  We  define  the  set  C  =  {w  £  Q  "*  :  w2  <  2}.  Suppose  to  the  contrary 
that  there  exists  that  satisfies  conditions  (i)  and  (ii)  stated  in  this  exercise.  Let 
b  =  —u  and  note  that  b  £  Q.  We  will  show  that  b  satisfies  conditions  (i)  and  (ii)  of 
Example  13.4. 

If  x  £  B,  we  let  w  =  —  x.  Then  w2  =  (— x)2  =  x2  <2.  Since  x  £  Q+,  we  have 
w  G  (Qp.  Hence  w  £  C.  Condition  (i)  shows  that  u  <  w.  Thus  x  =  —  w  <  —u  =  b. 
Hence  for  all  x  £  B,  we  have  x  <  b.  Therefore,  condition  (i)  of  Example  13.4  is 
satisfied. 

Now  suppose  c  £  Q  and  x  <  c  for  all  x  £  B.  If  w  £  C,  let  x  =  —w.  Note  that 
x2  =  (— w)2  =  w2  <  2  and  x  £  Q+.  Thus,  x  £  B  and  x  <  c.  So  w  =  — x  >  —  c.  Taking 
z  =  —  c  in  condition  (ii),  we  have  shown  that  w  >  z  for  all  w  £  C.  We  conclude  that 
u  >  z ■  Thus  b  =  — u  <  —z  =  c;  that  is,  b  <  c  and  condition  (ii)  of  Example  13.4  is 
satisfied. 

The  existence  of  the  rational  number  b  contradicts  the  result  of  Example  13.4. 
Hence  there  can  be  no  rational  number  u  corresponding  to  the  set  C  that  satisfies  the 
two  conditions  of  this  exercise. 


Problems 

Problem  13.1.  Find  the  supremum  and  infimum  (in  R)  of  the  subset  T  =  [0.  K)  nQ1 
of  M.  Justify  your  answers!  You  may  assume  that  K  is  irrational,  if  that  is  helpful. 

Problem  13.2.  Let  T  be  a  nonempty  subset  of  [0, 1].  Then  T  has  an  infimum  m  and 
a  supremum  M  in  R.  Show  that  m  and  M  belong  to  [0, 1]. 

Problem  13.3.  Consider  the  subset  S  =  R  \  Z  of  R.  Give  an  example  of  a  subset  7) 
of  5  that  has  an  infimum  and  supremum  in  S  and  a  bounded  subset  T2  of  S  that  has 
neither  an  infimum  nor  supremum  in  S. 

Problem  13.4.  Suppose  S  is  a  nonempty  bounded  subset  of  R  and  u  is  the  infimum 
of  5.  Let  v  =  inf{x  £  S  :  x  ^  w}.  Show  that  v  R  u. 

Problem  13.5.  Show  that  the  set  S  =  [0, 1]  DQ  is  not  complete  by  showing  that  there 
is  a  nonempty  bounded  subset  T  of  5  for  which  there  is  no  element  b  £  S  satisfying 


138 


13  Consequences  of  the  Completeness  of  R 


(i)  x  <  b  for  all  x  £  T,  and 

(ii)  if  c  €  S  and  x  <  c  for  all  x  £  T,  then  b  <  c. 


Problem  13.6.  Let  x  and  y  be  two  real  numbers  and  let  S  =  {x.  v}. 

(a)  Prove  that 


max{x,y} 


|x~  y|  +x  +  y 
2 


(b)  Find  a  similar  formula  for  the  minimum. 

(c)  Explain  why  these  formulas  also  yield  formulas  for  inf  S  and  sup  S. 


Problem  13.7.  If  A  and  B  are  subsets  of  R  for  which  sup, 4  =  sup/i  and  infA  =  inffi, 
must  A  =  B2  Either  prove  this  or  give  a  counterexample. 


Problem  13.8.  Suppose  that  A  and  B  are  disjoint  bounded  intervals  in  R.  Prove  that 
infA  >  sup/i  or  iul  /i  >  supA.  You  may  use  the  fact  that  if  a\  .ao  £  A  with  a\  <  aj, 
then  the  interval  [01,02]  C  A.  (Of  course,  you  may  use  the  corresponding  fact  for  B.) 

Problem  13.9.  For  each  of  the  following,  either  prove  the  statement  or  provide  a 
counterexample. 

(a)  Let  S  be  a  subset  of  R  consisting  of  20  positive  integers.  Then  S  has  a  supre- 
mum  U  and  an  infimum  u  and  both  11  and  U  belong  to  S. 

(b)  Suppose  that  S  is  a  nonempty  subset  of  R  and  S  has  a  supremum  U.  Let 
T  =  {x  £  S  :  x  <U}.  Then  T  =  S. 

(c)  Suppose  S  is  a  nonempty  bounded  subset  of  R  and  U  =  sup  S.  Suppose  further 
that  there  exists  anx  £  S  withx  <  U.  Let  v  =  sup{x  £  S :  x  <  U}.  Then  v  <  U. 

Problem  13.10.  In  the  proof  of  Theorem  13.2,  we  let  y  =  (2  —  a2)  / (3a)  for  the  case 
a2  <  2. 

Explain  this  choice  of  y  along  the  lines  of  the  explanation  for  the  choice  of  x 
following  the  proof  of  Theorem  13.2.  (We  need  a  real  number  y  satisfying  0  <y<a 
and  (a  +y)2  <  2.  Solve  a  simpler  problem.) 


Problem  13.11.  Prove  that  if  a  is  a  rational  number,  then  there  is  an  irrational  num¬ 
ber  b  such  that  a  <  b. 


Problem  13.12.  Prove  that  for  two  arbitrary  real  numbers  a  and  b  with  a  <  b,  there 
is  an  irrational  number  c  such  that  a  <c  <  b.  (Hint:  Consider  a/\] 2  and  b/y/ 2.) 

Problem  13.13.  Let  R  denote  a  relation  on  a  nonempty  set  S.  The  relation  R  is 
called  a  partial  order  if  the  following  three  conditions  are  satisfied. 

(i)  (Reflexive  property)  For  all  x  £  .S',  we  have  x  R  x. 

(ii)  (Transitive  property)  For  all  x,y,z  £  S,  if  x  R  y  and  y  R  z,  then  x  R  z. 

(iii)  (Antisymmetric  property)  For  all  x,y  £  S,  if  x  R  y  and  y  R  x,  then  x  =  y. 

The  relation  R  is  a  total  order  on  the  nonempty  set  S  if,  in  addition,  (iv)  below 
is  satisfied. 
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(iv)  For  all  x,y  £  S,  we  have  x  A  y  or  y  A  x. 

The  last  condition  says  that  we  can  always  compare  two  elements  x  and  y  in  S 
using  the  relation.  (A  partially  ordered  set  is  sometimes  called  a  poset.  It’s  much 
less  common  to  call  a  totally  ordered  set  a  toset.) 

Show  that  the  usual  <  relation  defines  a  total  order  on  R. 

Problem  13.14.  This  problem  uses  the  definitions  introduced  in  Problem  13.13. 

Let  A  be  a  set  containing  at  least  two  elements.  We  define  an  order  on  38(A) 
using  set  inclusion  C.  Show  that  C  is  a  partial  order,  but  not  a  total  order  on  38(A). 

Problem  13.15.  This  problem  uses  the  definitions  introduced  in  Problem  13.13. 
Consider  the  relation  <  on  R.  Show  that  this  is  not  a  total  order  by  exhibiting  coun¬ 
terexamples  for  each  property  ((i)— (iv))  violated  in  the  definition  of  total  order. 

Problem  13.16.  Prove  that  there  exists  x  £  R+  with  x2  =  3. 

Problem  13.17.  Prove  that  there  is  no  x  £  Q  with  x2  =  12,  but  there  is  an  x  £  E 
such  thatx2  =  12.  (You  might  find  Problem  13.16  helpful.) 

Problem  13.18.  In  Problem  13. 14  you  showed  that  (&(Z),C)  is  a  partial  order.  For 
every  nonempty  subset  sf  of  &(Z)  we  say  that  U  £  38  (Z)  is  an  upper  set  of  s3 ,  if 
X  C  U  for  all  X  £  ,<// .  A  nonempty  set  ,<//  C  83  (Z)  will  be  called  an  upper  bounded 
set  if  there  is  an  upper  set  of  in  38{Z3).  We  say  Uq  £  88(Z)  is  a  least  upper  set  if 
(i)  f/o  is  an  upper  set  of  and  (ii)  if  U  is  another  upper  set  of  ,  then  f/o  C  U. 

(a)  Let  ^  =  {{1,2, 5, 7},  {2, 8, 10},  {2, 5, 8}}.  Show  that  38  is  an  upper  bounded 
set  and  find  a  least  upper  set  of  38,  if  there  is  one. 

(b)  Prove  that  every  nonempty  subset  of  88  (Z)  is  upper  bounded. 

(c)  Define  “lower  set,”  “lower  bounded  set,”  and  “greatest  lower  set.” 

(d)  Let  be  a  nonempty  subset  of  /Y’fZ).  Using  union  and  intersection,  find  an 
expression  for  least  upper  set  of  .c/  and  greatest  lower  set  of  . 

(e)  Prove  that  (^(Z) ,  C)  has  the  “least  upper  set  property”  (in  other  words,  show 
every  upper  bounded  set  has  a  least  upper  set). 

Problem  13.19.  Prove  that  (Z,  <)  is  complete  in  the  following  sense:  If  A  is  a 
nonempty  set  of  integers  that  is  bounded  above,  then  there  is  an  integer  a  such  that 
a  =  supA. 

Problem  13.20.  Let  S  be  a  nonempty  subset  of  R,  and  x  £  R.  Suppose  that  there 
exists  y  £  S  with  y  <  x.  Let  yo  =  sup{y  £  S  :  y  <  x}. 

(a)  Give  an  example  of  such  a  set  S  and  real  number  x. 

(b)  Show  that  yo  <  x. 

(c)  Give  an  example  to  show  that  yo  may  equal  x. 

(d)  Give  an  example  to  show  that  yo  may  be  strictly  less  than  x. 
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Problem  13.21.  Prove  the  following  statement:  For  every  positive  irrational  real 
number  a  with  \Ja  <  10000,  there  is  a  positive  integer  n  such  that 

10000  n  10000 

- <  sfa  < - t-. 

n  n  —  1 

Problem#  13.22.  To  practice  some  of  the  more  challenging  theorems  in  this  and  the 
previous  chapter,  we  use  some  of  the  techniques  we  have  discussed  to  establish  the 
division  algorithm  in  Z.  We  wish  to  prove: 

Theorem  13.6  (Division  algorithm).  Let  m  and  n  be  integers  with  n  /  0.  Then  there 
exist  q,  r  £  Z  satisfying  m  =  nq  +  r,  where  0  <  r  <  \n\  and  q  and  r  are  unique. 

We  outline  the  proof  below.  You  should  complete  the  details. 

(a)  Let  A  =  {m  —  nx  :  x  £  Z  and  m  —  nx  >  0}.  Show  that  A  is  nonempty. 

(b)  Explain  why  there  exists  r  €  Z  such  that  r  =  min  A. 

(c)  Explain  why  0  <  r  <  \n \ . 

(d)  Use  your  work  above  to  prove  that  there  are  q,  r  £  Z  such  that  m  =  nq+r  and 
0  <  r  <  \n\. 

(e)  We  now  need  to  show  uniqueness.  Recall  that  to  show  q  and  r  are  unique,  we 
will  assume  that  there  exist  q\  and  r\  satisfying  several  properties.  Write  out 
the  properties. 

(f)  Show  that  |/i|  |  (r  —  ri)  and  that  —  \n\  <  r  —  r\  <  \n\. 

(g)  Finish  the  proof. 


Tips:  You  Solved  It.  Now  What? 

Let’s  say  we  are  now  at  the  point  where  you  solved  the  given  problem  and  wrote  up 

your  first  draft. 

•  Look  over  your  solution.  Does  it  use  everything  you  are  given? 

•  Is  the  answer  reasonable? 

•  If  there  were  places  where  you  were  unsure  of  your  argument,  check  over  those 
arguments  carefully.  You  might  find  it  helpful  to  write  the  solution,  take  a  break, 
and  then  check  the  solution.  (We  say  that  a  lot,  don’t  we?) 

•  Is  your  argument  clear?  Did  you  choose  your  notation  well?  Is  your  notation 
clear  and  unambiguous?  Did  you  introduce  all  notation  before  you  used  it? 

•  Is  there  a  shorter  or  more  intuitive  argument? 

•  Do  you  fully  understand  what  you  did?  Spend  some  time  thinking  about  the 
method  and  what  you  proved.  Could  you  have  gotten  a  better  result? 

•  When  do  these  methods  work?  What  are  the  restrictions?  Where  have  you  seen 
them  before? 

•  If  the  problem  was  hard  for  you  to  solve,  what  made  it  hard?  What  were  the 
important  ideas  that  you  were  missing? 
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•  This  is  a  good  opportunity  to  learn  about  yourself,  too.  Which  problems  do  you 
like  best?  Why? 


Chapter  14 

Functions,  Domain,  and  Range 


What  is  a  function?  You’ve  probably  gotten  a  definition  of  this  somewhere  along  the 
way.  We  will  state  the  definition  of  function  in  terms  of  relations,  which  is  probably 
different  than  the  way  you  have  seen  it  stated. 

Let  A  and  B  be  sets.  A  function  f  from  A  to  B  is  a  relation  /  from  A  to  B 
satisfying 

(i)  for  all  a  £  A,  there  exists  b  £  B  such  that  (a,  b)  £  /,  and 

(ii)  for  all  a  £  A,  and  all  b,c  £  B,  if  (a,b)  £  /  and  (a,c)  £  /,  then  b  =  c. 

A  function  is  often  called  a  map  or  mapping.  We  usually  write  /  :  A  — >  B  to  in¬ 
dicate  that  /  is  a  function  from  A  to  B.  The  sets  A  and  B  may  be  explicitly  identified, 
but  they  are  often  understood  from  the  context.  (See  Problem  14.4.) 

When  we  know  what  the  two  sets  are  and  that  the  two  conditions  are  satisfied, 
we  say  /  is  a  well-defined  function.  If  the  object  we  try  to  define  does  not  satisfy 
these  properties,  it  isn’t  a  function,  and  we  often  say  that  /  (which  we  shouldn’t  call 
a  function)  is  not  well-defined. 

Condition  (i)  makes  sure  that  each  element  in  A  is  related  to  some  element  of 
B,  while  condition  (ii)  makes  sure  that  no  element  of  A  is  related  to  more  than  one 
element  of  B.  Note  that  it  may  be  the  case  that  an  element  of  B  has  no  element  of  A 
to  which  it  is  related;  or  an  element  of  B  could  be  related  to  more  than  one  element 
of  A.  The  set  A  is  called  the  domain,  and  denoted  by  dom(/),  and  the  set  B  is  called 
the  codomain,  and  denoted  cod(/). 

As  a  first  example,  consider  the  function  that  assigns  to  a  citizen  of  the  United 
States  his  or  her  height  measured  in  inches  on  a  particular  day  at  a  particular  time. 
We’ll  assume  that  people  are  at  most  20  feet  tall.  This  is  a  function  because  we  have 
a  domain  (the  set  of  the  citizens  of  the  United  States  on  a  particular  day  at  a  particu¬ 
lar  time),  a  codomain  (the  set  of  real  numbers  between  0  and  240  inches),  condition 
(i)  (each  person  has  a  height),  and  condition  (ii)  (each  person  has  exactly  one  height 
on  that  day,  at  that  time).  Now  let’s  turn  to  a  nonexample.  We  still  consider  the  do¬ 
main  to  be  the  set  of  citizens  of  the  United  States,  but  this  time  let  the  codomain 
consist  of  all  the  countries  in  the  world  (on  a  particular  day,  at  a  particular  time). 
Consider  the  relation  that  assigns  to  each  person  in  the  domain  his  or  her  country 
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Fig.  14.1 


(countries)  of  citizenship.  This  is  not  a  function  because  a  United  States  citizen  can 
be  a  citizen  of  more  than  one  country.  Though  (i)  is  satisfied  because  each  person  in 
the  domain  is  a  U.S.  citizen,  (ii)  is  not. 

Exercise  14.1.  Let  A  =  {1,2,3}  and  B  =  {2,4,6}.  Which  of  the  following  are  func¬ 
tions  from  A  to  B  !  If  they  are  not  functions,  explain  which  rule  is  violated. 

(a)  The  relation  /  is  {(1,2),  (2,4),  (3,4)}. 

(b)  The  relation /is  {(1,2),  (1,4),  (2, 2),  (3, 6)}. 

(c)  The  relation  /  is  {(1,2),  (3,4)}. 

(d)  The  relation /is  {(2,4),  (1,2),  (3,6)}.  O 

Exercise  14.2.  You  probably  learned  that  a  function  /  :  K.  — >  K.  can  be  represented 
by  a  graph,  and  that  there  is  a  vertical  line  test  to  determine  whether  or  not  /  is 
a  function  (see  Figure  14.1).  Which  condition  in  the  definition  corresponds  to  the 
vertical  line  test?  Why?  O 

In  Exercise  14.1,  you  probably  recognized  (d)  as  a  function  from  A  to  B.  It  is 
more  usual  to  write  /( 1)  =  2,  /( 2)  =  4,  and  /( 3)  =  6.  Since  each  x  in  the  domain  is 
related  to  a  unique  y  in  the  codomain,  we  will  write  /(x )  =  y  rather  than  (x,y)  £  /. 

Exercise  14.3.  Rewrite  the  definition  of  a  function  using  the  notation  introduced  in 
the  paragraph  above.  Q 


Here  are  some  more  examples  and  nonexamples  of  functions. 

Exercise  14.4.  Decide  which  of  the  following  are  functions  and  which  are  not,  giv¬ 
ing  reasons  for  your  answers. 

(a)  Let  / :  R  — >  K.  be  defined  by  /(x)  =  3x  +  2. 

(b)  Let  / :  R  — >  R  be  defined  by  f(x)  =  1  /x2. 

(c)  Let  / :  R  — >  R2  be  defined  by  /(x)  =  (x,x). 

(d)  Let  /  :  Q  —>■  Q  be  defined  by  f{p/q)  =  1  /q,  where  p  and  q  are  integers  and 

q  7^o. 

(e)  Let  /  :  M2  — >  R2  be  defined  by  f(x,y)  =  (x, 3). 


O 
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Fig.  14.2  The  graph  on  the  left  is  of  /,  the  one  on  the  right  is  of  g 

You  have  seen  many  examples  of  functions.  One  particular  type  of  example,  that 
of  a  function  defined  in  cases,  allows  us  to  explicitly  illustrate  many  of  the  ideas 
discussed  in  this  section.  Before  you  begin  working  with  a  function  that  is  defined 
in  cases,  make  sure  that  you  understand  the  function.  If  you  can,  graph  it.  Remember 
that  the  best  thing  to  do  is  to  work  with  concrete  objects  (like  trying  x  =  2  or  x  =  —  3) 
until  you  get  a  feel  for  what  is  happening.  For  functions  that  are  defined  in  cases  we 
have  to  be  particularly  careful  to  check  that  the  cases  don’t  overlap;  or  if  they  do, 
that  the  function  is  defined  in  a  unique  way  for  all  the  elements  in  the  domain  that 
are  in  the  overlap.  Of  course,  we  are  not  changing  the  rules  here.  All  you  really  have 
to  do  is  check  that  you  know  what  the  domain  and  codomain  are,  and  that  conditions 
(i)  and  (ii)  of  the  definition  hold.  Here  are  some  examples. 

Example  14.5.  We  will  check  to  see  whether  each  of  the  objects  defined  below  and 
graphed  in  Figure  14.2  is  a  function. 

(a)  Let  /  :  R  — >  R  be  defined  by 


(b)  Let  g  :  R  — >  R  be  defined  by 


0  if  x  >  2 
l/x  if  x  <  3 


For  (a)  note  that  the  domain  is  R  and  the  codomain  is  also  R.  From  the  definition 
of  /  it  is  easy  to  see  that  /  is  defined  for  all  x  £  R.  Hence  condition  (i)  of  the 
definition  of  a  function  holds. 

Now  let  a  £  R  and  suppose  that  there  exist  real  numbers  b  and  c  with  f(a)  =  b 
and  f{a)  =  c.  The  most  orderly  way  to  check  condition  (ii)  is  the  following:  If  a  <  0, 
then  b  =  f{a)  =  a 2  and  c  =  f(a )  =  a2,  so  b  =  c.  If  a  >  0,  then  b  =  f(a)  =2a+l 
and  c  =  f[a)  =2a+\.  Hence  b  =  c.  In  either  case,  b  =  c.  So  condition  (ii)  holds. 
Since  both  (i)  and  (ii)  are  satisfied,  /  is  well-defined. 
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The  formula  given  in  part  (b)  does  not  define  a  function  for  two  reasons.  First 
note  that  0  is  in  the  domain.  Since  0  <  3,  we  see  that  g( 0)  is  not  defined  to  be  an 
element  in  the  codomain.  Hence  condition  (i)  is  violated,  and  we  conclude  that  g  is 
not  a  function. 

We  mention  here  that  there  is  a  second  problem  with  the  definition  of  g:  consider 
a  real  number  a  such  that  2  <  a  <  3.  For  instance,  let  a  =  2.5.  Then  g( 2.5)  =  0 
(since  2.5  >  2)  and  g(2.5)  =2/5  (since  2.5  <  3).  This  violates  condition  (ii)  of  the 
definition  of  a  function.  Thus  g  does  not  satisfy  condition  (i)  or  (ii).  Therefore,  the 
object  defined  above  is  not  a  function,  for  two  reasons. 

Although  the  example  above  violates  both  (i)  and  (ii),  keep  in  mind  that  it  is 
enough  that  (i)  or  (ii)  alone  be  violated  to  ensure  that  /  is  not  a  function.  Q 


Exercise  14.6.  For  each  of  the  two  examples  below  decide  whether  or  not  the  object 
so  defined  is  a  function.  Give  reasons  for  your  answers. 


(a)  Let  / :  R  — >  R  be  defined  by 

f(x) 

(b)  Let  / :  Z  — >  Z  be  defined  by 


x2  if  x  >  0 
—  (x2)  if  x  <  0 


|  1  if  x  €  2Z 

/(x)  =  <  2  if  x  is  prime  . 
I  3  otherwise 


o 


One  very  important  example  of  a  function  defined  in  cases  is  the  familiar  absolute 
value  function. 

Example  14.7.  The  absolute  value  function  /  :  R  — >  R  is  defined  by  /(x)  =  |x|.  It  is 
easy  to  check  that  this  does  define  a  function  on  R.  O 

When  you  define  a  new  mathematical  concept,  it’s  always  a  good  idea  to  think 
about  it  and  pose  questions.  Of  course,  it’s  also  a  good  idea  to  answer  those  ques¬ 
tions,  if  you  can.  We  now  turn  to  some  questions  that  we  find  interesting.  See  if  you 
can  think  of  some  questions  on  your  own. 

What  does  it  mean  to  say  that  two  functions  / :  A  — >  B  and  g  :  A  — >  B  are  equal? 
Since  this  is  a  very  important  concept  that  we  will  need  again  later,  we  provide  the 
answer  here.  But  try  to  think  about  how  this  answer  follows  from  the  definition  of  a 
function. 

Two  functions  /  :  A  — >  B  and  g  :  A  — >  B  are  equal  if  and  only  if  /(x)  =  g(x) 

for  all  x  G  A. 

Here’s  a  second  question:  What  is  the  function’s  relationship  to  elements  of  the 
domain,  and  how  does  this  differ  from  the  function’s  relationship  to  elements  of  the 
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Fig.  14.3 


codomain?  We  must  be  able  to  evaluate  /  for  every  element  in  the  domain,  while 
elements  in  cod (/)  may  or  may  not  be  associated  with  elements  of  the  domain. 
The  elements  of  the  codomain  that  are  related  to  elements  of  dom(/)  are  obviously 
important  in  understanding  the  function.  For  this  reason,  we  look  at  the  set  called 
the  range  of  /,  which  consists  precisely  of  these  points. 

Given  a  function  f  :  A— y  B,  the  range  of/,  denoted  ran(/),  is  defined  by 

ran(/)  =  there  exists  at  least  one  a  £  A  such  that  f(a)  =  b}. 

Sometimes  it  is  fairly  easy  to  determine  the  range,  but  it  generally  requires  a 
method  (demonstrated  below)  that  we  think  of  as  working  backwards.  You’ll  start 
with  b  £  B  and  try  to  find  a  £A.  Then,  to  show  that  b  £  ran(/),  you  have  two  things 
to  check:  The  element  a  must  map  to  b  under  /  (that  is,  f(a)  =  b),  and  a  must  be  an 
element  of  A.  This  latter  statement  is  often  obvious,  but  don’t  forget  to  check  it! 

It’s  always  easier  to  start  with  small  sets  and  see  if  you  understand  what  is 
happening.  You  can  do  this  visually  as  well.  For  example,  say  A  =  {1,2,3}, 
B  =  {2,3,4},  and  the  function  /  :  A  — >  B  is  defined  by  /( 1)  =  3,/(2)  =  3,  and 
/( 3)  =  4.  We  can  “see”  the  action  of  /  by  drawing  a  little  picture  as  in  Figure  14.3 

From  this  picture  we  can  see  easily  that  /  sends  two  things  to  3,  one  thing  to  4, 
and  nothing  to  2.  So  we  can  “see”  that  though  2  is  in  the  codomain,  it  is  not  in  the 
range.  If  you  are  asked  for  examples  or  counterexamples,  remember  that  small  sets 
will  sometimes  do  the  trick! 

Our  next  example  is  really  a  method.  Once  we  complete  the  example,  we  will 
review  exactly  what  you  must  do  in  similar  circumstances.  But  one  thing  we  will 
mention  in  advance:  you  will  always  need  to  devise  a  plan  as  we  do  below. 

Example  14.8.  Let  / :  R\  {3}  — » Rbe  defined  by  f{x)  =  (x+  l)/(x  —  3).  Determine 
the  range  of  /. 

“Devising  a  plan.”  We  need  to  figure  out  which  y  £  R  come  from  something 
under  /.  It’s  a  bit  difficult  to  simply  gaze  at  /,  or  even  the  graph  of  /,  and  see  what 
comes  out  of  it,  so  we’ll  try  working  backwards  to  see  what  y  might  be.  (Though 
the  graph  of  /  provides  a  good  way  to  see  if  your  answer  is  reasonable,  it  does  not 
provide  a  proof.)  So,  suppose  y  £  R  did  come  from  something  in  the  domain.  That 
would  mean 


y  =  f(x),  for  some  x  £  R\  {3}; 
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in  other  words,  y  =  (x  +  l)/(x  —  3).  Since  we  need  to  figure  out  what  x  is,  we  should 
solve  for  it.  Multiplying  through  by  x  —  3,  we  get  (x  +  1 )  =  yx—3y.  Collecting  all 
terms  involving  x  yields  x  —  yx=  -  3y  —  1.  Factoring  out  x,  dividing,  simplifying, 
and  ignoring  potential  problems  (like  what?),  we  get  x  =  (3y+  l)/(y  —  1).  So  if  y 
came  from  some  x  at  all,  y  had  to  come  from  x  =  (3y  +  1  )/(y  —  1 ) .  That’s  fine,  as 
long  as  y  ^  1  (that  was  a  potential  problem).  So  ran (/)  “appears  to  be” 

The  reason  for  saying  “appears  to  be”  is  that  we  started  by  assuming  y  came  from 
something  called  x,  and  then  found  out  what  x  had  to  be.  But  the  definition  of  range 
really  requires  us  to  start  with  an  x  and  show  that  /(x)  =  y.  So  we  need  to  check  that 
everything  we  did  above  is  reversible,  and  that  the  two  sets  ran(/)  and  R\  {1}  are 
equal.  All  of  this  was  helpful  in  deciding  what  the  range  is,  but  the  actual  proof  is 
still  to  come.  The  proof  below  is  the  form  you  should  follow.  When  we  write  it,  we 
need  to  pretend  the  reader  has  not  seen  the  work  we  just  completed. 

Proof.  We  will  show  that  ran(/)  =  R  \  {1}.  Let  y  £  ran (/).  Then,  clearly,  y  £  R. 
So  ran(/)  C  R.  To  show  that  y  ^  1,  suppose  that  this  is  not  the  case;  so  we  will 
suppose  y  =  1  £  ran (/)  and  see  what  happens.  Since  y  £  ran (/),  there  exists  a  point 
x  in  the  domain  with  /(x)  =  y  =  1.  Using  the  definition  of  /,  we  find  that  1  =  /(x) 
=  (x+l)/(x  —  3).  Therefore,  x  +  1  =  x  —  3.  This  would  mean  that  1  =  —3,  which  is 
not  possible.  So  y  £  ran  (/)  implies  ygt  and  yf=l.  Thus,  ran  (/)  C  R  \  { 1 }. 

Now  let  y  £  M\  {1}.  Let  x  =  (3y+  l)/(y—  1).  Since  y  1,  we  see  that  x  £  R. 
Remember  that  we  need  to  check  that  x  £  dom (/).  We  know  that  x  £  1.  Could 
we  possibly  have  x  =  3?  Suppose  we  do,  then  3  =  (3y+  l)/(y—  1)  which  implies 
3y  —  3  =  3y  +  1.  Thus  we  would  have  —3  =  1,  which  is  impossible.  So  x  £  dom (/) 
and  we  can  evaluate  /  at  x  to  obtain 

^  Mj1  -3  ~  3y  +  1  —  3y  +  3  _;V' 

It  follows  that  R  \  { 1 }  C  ran (/) .  Therefore  ran(/)  =  R  \  { 1 },  completing  the  proof. 

□ 

Before  going  on,  we  will  make  two  remarks.  If  you  hadn’t  read  “Devising  a 
plan”  above  the  proof,  the  definition  of  x  =  (3y+  l)/(y  —  1)  would  probably  look 
bizarre.  Remember  that  we  didn’t  guess  it;  we  worked  backwards  to  see  what  x  had 
to  be.  One  other  thing  to  note  is  that  ran(/)  ^  R,  but  ran(/)  =  R \  { 1}.  So  /  maps 
into  R  but  it  doesn’t  “hit”  the  value  1.  We’ll  come  back  to  this  in  the  next  chapter. 

o 

So  what  must  we  do  when  we  have  to  find  the  range  of  a  function?  First,  we  need 
to  take  out  a  different  sheet  of  paper  and  figure  out  what  the  set  should  be.  Let’s  say 
we  decide  the  range  is  a  set  called  B.  Then  we  need  to  show  the  reader  that  the  two 
sets  are  equal.  There  are  often  many  ways  to  do  it,  but  one  way  is  to  start  with  an 
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element  in  the  range  (tell  the  reader  you  are  doing  this)  and  show  it  is  in  B.  Then  start 
with  an  element  y  in  B  (tell  the  reader  you  are  doing  this,  too)  and  find  an  x  (which 
you  found  somewhere  else,  but  the  reader  doesn’t  necessarily  need  to  see  that)  that 
satisfies  two  things:  x  is  in  the  domain  of  your  function  and  f(x)  =  y.  Write  your 
proof  up  carefully,  identifying  variables  before  you  use  them,  and  always  checking 
that  your  variables  are  in  the  appropriate  sets. 

Exercise  14.9.  What  is  the  range  of  each  of  the  functions  below?  A  picture,  when 
appropriate,  is  a  lovely  addition  and  is  heartily  encouraged.  It  does  not,  however. 


substitute  for  the  real  thing.  Write  out  everything  explicitly. 


(a)  The  function  / :  R  \  {0}  — >  R  defined  by  f(x)  =  1  /x. 

(b)  The  function  / :  Z  x  (Z\  {0})  — >  M  defined  by  f(x,y)  =  x/y. 

(c)  The  function  / :  R  — >  R  defined  by  f(x)  =  x2  +  4x  +  5. 


O 


Definitions 

Definition  14.1.  Let  A  and  B  be  sets.  A  function  f  from  A  to  B  is  a  relation  /  from 
A  to  B  satisfying 

(i)  for  all  a  £  A,  there  exists  b  £  B  such  that  ( a,b )  £  /,  and 

(ii)  for  all  a  £  A,  and  all  b,c  £  B,  if  ( a,b )  £  f  and  (a,c)  £  /,  then  b  =  c. 

The  standard  notation  is  /  :  A  — >  B  and  b  =  f{a)  for  (a.b)  £  f.  A  function  is  also 
called  a  map  or  a  mapping. 

Definition  14.2.  Given  a  function  /  :  A  — >  B,  the  set  A  is  called  the  domain  of 
/,  denoted  by  dom(/),  and  the  set  B  is  called  the  codomain  of  /,  and  denoted  by 


cod(/). 


Definition  14.3.  Given  a  function  /  :  A  — >  B,  the  range  of  /,  denoted  ran(/),  is 
defined  by 

ran(/)  =  {b  £  B  :  there  exists  at  least  one  a  £  A  such  that  f{a)  =  b}. 

Definition  14.4  (for  Problems  14.6  through  14.9).  Let  A  be  a  nonempty  set  and  let 
A  be  a  subset  of  X.  The  characteristic  function  or  indicator  function  of  the  set  A 

in  X  is 


Definition  14.5  (for  Problems  14.7  through  14.9).  Let  I  be  an  interval  of  R  (open, 
closed,  half  open,  or  all  of  R).  A  step  function  is  a  function  /  :  I  — >  R  of  the  form 


n 


f=  Y,akXAk, 


k=l 
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where,  for  k=  1 .....  n,  we  have  ai-  £  R  and  intervals  (any  type,  including  I  itself 
or  containing  just  one  point)  such  that  [Af,  :  k  £  { 1 }  forms  a  partition  of  I 
and  has  the  additional  property  that  Aj  D  A(  =  0  if  j  ^  l. 

Definition  14.6  (for  Problems  14.11  through  14.13).  For  x £  M  we  define  the 
greatest  integer  of  x  by  [xj  =  n,  where  n  £  Z  and  n  <  x  <  n  +  1.  The  greatest 
integer  function  or  floor  function  is  the  function  /  :  R  — >  Z,  defined  by  f(x)  =  |_x| . 


Solutions  to  Exercises 

Solution  (14.1).  The  relations  in  (a)  and  (d)  are  functions,  those  in  (b)  and  (c)  are 
not. 

Solution  (14.2).  Condition  (ii)  corresponds  to  the  vertical  line  test,  since  it  says  that 
if  we  draw  the  vertical  line  x  =  a,  it  should  pass  through  the  graph  of  /  at  most  once. 

Solution  (14.3).  A  function  f  from  a  set  A  to  a  set  B  is  a  relation  /  :  A  — >  B 

satisfying 

(i)  for  all  a  £  A,  there  exists  b  £  B  such  that  f(a)  =  b,  and 

(ii)  for  all  a  £  A,  and  all  b7c  £  B,  if  f(a)  =  b  and  f(a)  =  c,  then  b  =  c. 

Solution  (14.4).  Parts  (a),  (c),  and  (e)  define  functions.  The  others  do  not.  In  (b), 
we  have  not  defined  /( 0)  as  an  element  of  R.  In  (d)  note  that  if,  for  example,  we 
consider  a  =  2/1  =  4/2,  then  /(2/1)  =  1,  while  /( 4/2)  =  1/2.  Thus  (2,1)  and 
(2, 1  /2)  are  both  elements  of  the  relation  and  condition  (ii)  is  violated. 

Solution  (14.6).  For  part  (a),  the  domain  and  codomain  are  both  R  and  both  con¬ 
ditions  in  the  definition  of  a  function  are  satisfied.  Note  that  though  x  =  0  appears 
twice  in  the  definition  of  /,  in  both  cases  /( 0)  =  0.  For  part  (b),  consider  x  =  2.  Since 
x  =  2  £  2Z,  we  have  /( 2)  =  1 .  On  the  other  hand,  2  is  also  prime,  so  /( 2)  =  2.  Thus 
(2, 1)  and  (2,2)  are  both  elements  of  the  relation,  but  1  /  2,  and  condition  (ii)  is 
violated. 

Solution  (14.9).  For  (a)  we  claim  that  ran(/)  =  R\  {0}.  Clearly,  ran(/)  C  R\  {0}. 
So  suppose  that  ygR\  {0}.  Let  x  =  1/y.  Then  x  £  R  and  x  ^  0.  Thus,  x  £  dom(/). 
Furthermore,  f(x)  =  1/(1  /y)=y.  Therefore,  y  £  ran(/)  andR\{0}  C  ran (/),  com¬ 
pleting  the  proof. 

For  (b)  we  claim  that  ran(/)  =  Q.  If  z  £  ran (/),  then  there  exists  (x,y)  £  dom(/) 
=  Zx(Z  \  {0})  with  z  =  f(x,y)  =x/y  £  Q.  Thus  ran(/)  C  Q.  Conversely,  if  z  £  Q, 
then  z  =  p/q  for  some  p,q  £  Z  and  q  ^  0.  Hence  ( p7q )  £  dom(/)  and  f(p,q ) 
=  p/q  =  z.  Thus  z  £  ran (/)  and  Q  C  ran (/).  The  two  parts  together  establish  the 
claim. 

For  (c)  we  claim  that  ran (/)  =  {z  £  R  :  z  >  1}.  (We  went  to  another  sheet  of 
paper  to  come  up  with  this  claim.  A  sketch  (see  Figure  14.4)  is  also  helpful  here, 
but  it  is  not  a  proof.) 
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y 


-5  -4  -3  -2  -1 


Fig.  14.4  f(x)  =  x1  +  4.x  +  5 

Proof.  First  note  that  if  y  G  ran(/) ,  then  there  exists  rgR  such  that  y  =  x2  +  4x  +  5 . 
Completing  the  square,  we  get  y  =  (x  +  2)2  +  1.  Since  (x  +  2)2  >  0,  we  see  that 
y  >  1.  Therefore,  y  G  {z  £  K  :  z  >  1},  and  hence  ran(/)  C  {z  £  ffi.  :  z  >  1}. 

Now  suppose  thaty  £  {z  £  K. :  z  >  1}-  Letx=  y/y  —1—2.  (We  worked  backwards 
to  get  this,  of  course.)  Since  y  >  1,  we  have  x  £  I.  So  x  £  dom(/).  Furthermore, 
/(x)  =  f(y/y-  1  —  2)  =  ( y/y —  1  —  2)2  +4(y'y  —  1  —2)  +5.  Thus  (as  the  reader  can 
check)  /(x)  =  y  —  1  —  4 y/y—  1  +4  +  4 yjy—  1  —  8  +  5  =  y.  Therefore,  y  G  ran(/)  and 
{z£M:z>  1}  C  ran  (/),  as  desired.  □ 


Spotlight:  The  Definition  of  Function 

It’s  probably  difficult  to  imagine  that  there  could  be  any  debate  about  the  definition 
of  function.  In  fact,  the  development  of  the  definition  of  function  is  quite  interest¬ 
ing.  For  example,  Leonhard  Euler  first  defined  a  function  as  follows  [94,  p.  72]:  “A 
function  of  a  variable  quantity  is  an  analytical  expression  composed  in  any  manner 
from  that  variable  quantity  and  numbers  or  constant  quantities.”  Euler  later  revised 
his  definition  because  of  work  on  a  problem  known  as  the  vibrating  string  prob¬ 
lem.  Discussion  ensued,  and  Dirichlet  is  now  often  credited  with  providing  us  with 
roughly  the  definition  we  use  today. 

Once  this  discussion  appeared  to  be  settled,  people  could  then  concentrate  on 
studying  various  kinds  of  functions;  including,  for  example,  continuous,  discontin¬ 
uous,  differentiable,  or  even  nowhere  differentiable  functions.  Dirichlet  also  intro¬ 
duced  the  following  example  (now  called  the  Dirichlet  function): 


where  c  and  d  are  distinct  real  numbers.  This  was  the  first  example  of  many  things, 
including  the  first  example  of  a  function  that  is  discontinuous  everywhere  (see  [58]). 
In  a  very  interesting  article  written  around  1940  (or,  rather,  the  English  translation  of 
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this  article),  Luzin  points  out  that  not  everyone  agreed  that  Dirichlet  had  completely 
answered  the  question  of  what  a  function  is.  According  to  Luzin  [65,  p.  263],  some 
mathematicians  found  the  definition  perfect,  others  found  it  too  broad,  and  still  oth¬ 
ers  found  it  meaningless.  Even  as  late  as  1928,  Hermann  Weyl  [109,  p.  22]  stated 
that  no  one  can  explain  what  a  function  is;  then  Weyl  finishes  the  paragraph  by 
telling  us  what  a  function  is:  “A  function  is  given  if  by  some  definite  rule  to  each 
real  number  a  there  is  assigned  a  real  number  b  (as  e.g.  by  the  formula  b  =  2a  +  1). 
One  then  says  that  b  is  the  value  of  the  function  /  for  the  value  a  of  the  argument.”1 

For  an  overview  of  the  definition  of  the  concept  of  function,  we  recommend 
Riithing’s  entertaining  paper  [94],  where  definitions  (from  1718  to  1939)  attributed 
to  various  authors  are  presented  in  their  original  language,  with  translation  and  with¬ 
out  comment.  You  will  notice  that  the  final  definition,  due  to  N.  Bourbaki  and  given 
in  1939,  agrees  with  our  definition. 

The  history  of  the  vibrating  string  problem  is  described  in  [59,  pp.  503-518]. 
In  [57,  p.  724],  Katz  presents  the  definition  of  function  used  by  Johann  Bernoulli, 
an  earlier  and  later  definition  used  by  Euler,  and  definitions  attributed  to  Lacroix, 
Fourier,  Heine,  and  Dedekind.  For  a  complete  and  readable  overview  on  this  topic, 
we  recommend  the  papers  of  Luzin  (both  [64]  and  [65]),  Youschkevitch  [113],  and 
Kleiner  [58].  Kleiner’s  paper  also  has  an  extensive  bibliography. 


Problems 

Problem  14.1.  Complete  the  following:  A  relation  /  :  A  — >  B  is  not  a  function  if  . . . 

Problem  14.2.  Suppose  that  /  :  X  — >  Y .  Recall  that  the  definition  of  ran(/)  was 
stated  in  the  text.  State  carefully  what  it  means  when  we  say  y  £  Y  is  not  in  the 
range  of  /. 

Problem  14.3.  Which  of  the  following  are  functions  from  the  set  A  to  the  set  B! 
Give  reasons  for  your  answers. 

(a)  Define  /  :  R  — >  R  by  the  relation  /  =  {(x,y)  :x2+y2  =  4}  on  R. 

(b)  Define  /  :  M  — >  R  by  f(x)  =  1  /  (x  +  1 ) . 

(c)  Define  /:  R2  — >  R  by  f(x,y)  =  x  +  y. 

(d)  The  domain  of  /  is  the  set  of  all  closed  intervals  of  real  numbers  of  the  form 
[a,b\,  where  a,b  £  R,  a  <  b,  the  codomain  of  /  is  R,  and  /  is  defined  by 
f([a,b])  =a. 

(e)  Define  /  :  N  x  N  — >  R  by  f(n,m)  =  m. 

(f)  Define  /  :  R  — >  R  by 


1  The  translation  is  ours. 
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(g)  Define  /  :  Q  — >  R  by 

(  x+l  if  x  €  2Z 
/(x)  =  <  x  —  1  if  x  e  3Z  . 

[  2  otherwise 

(See  Example  6.1  for  the  definitions  of  2Z  and  3Z.) 

(h)  The  domain  of  /  is  the  set  of  all  circles  in  the  plane  R2,  the  codomain  is  R, 
and  if  c  is  a  circle  in  the  domain,  define  /  by  /(c)  =  the  circumference  of  c. 

(i)  (For  students  with  a  background  in  calculus.)  The  domain  and  codomain  of 
/  are  the  set  of  all  polynomials  with  real  coefficients,  and  /  is  defined  by 
f(p')  =  p1 .  (Here  p1  is  the  derivative  of  p.) 

(j)  (For  students  with  a  background  in  calculus.)  The  domain  of  /  is  the  set  of 
all  polynomials,  the  codomain  is  R,  and  /  is  defined  by  f(p)  =  f0  p{x)dx. 
(Here  /0*  p(x)  dx  is  the  definite  integral  of  p.) 


Problem  14.4.  A  function  /  :  A  — >  R  is  often  called  a  real-valued  function.  Thus, 
authors  will  often  say,  “Let  /  be  a  real-valued  function.”  Sometimes,  A  is  explicitly 
defined.  Other  times,  it  is  understood  that  the  codomain  is  R  and  that  the  domain 
is  the  largest  set  A  for  which  all  values  under  /  result  in  real  numbers.  For  all  real¬ 
valued  functions  below,  specify  the  implied  domain,  assuming  that  ACK. 


(a)  /(x)  =  2±f; 

(b)  g(x)  =  ln(2x2  +  x  —  6) ; 

(c)  h(x)  =  y/Sx—  15  —  x2; 


(d) 


(*— 3)(2x+5) 
x(x+2)(x— 5) ' 


Problem  14.5.  Let  /  :  A^(R)  — >  Z  be  defined  by 


m  = 


min(AnN)  ifAON/0 
-1  ifAON  =  0' 


Prove  that  /  above  is  a  well-defined  function. 

Problem  14.6.  Let  X  be  a  nonempty  set  and  let  A  be  a  subset  of  X.  The  character¬ 
istic  function  or  indicator  function  of  the  set  A  in  A  is 


Xa  :  X  ->  {0, 1}  defined  by  %A  (x) 


1  if  x  e  A 
0  if  x  £  X  \A 


(a)  Since  this  is  called  the  characteristic  function,  it  probably  is  a  function,  but 
check  this  carefully  anyway. 

(b)  Determine  the  domain  and  range  of  this  function.  Make  sure  you  look  at  all 
possibilities  for  A  and  X. 
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Problem  14.7.  Using  the  characteristic  function  from  Problem  14.6  we  can  define 
another  useful  type  of  function: 

Let  I  be  an  interval  of  R  (open,  closed,  half  open,  or  all  of  R).  A  step  function 
is  a  function  /  :  /  — >  R  of  the  form 


/=  Y*akXAk, 
k= 1 

where,  for  k  =  1 , . . .  ,n,  we  have  ak  €  R  and  intervals  A/;  (any  type,  including  I  itself 
or  containing  just  one  point)  such  that  {Ak  :  k  £  { 1 }  forms  a  partition  of  I 
and  has  the  additional  property  that  Aj  D  A(  =  0  if  j  ^  l. 

(a)  Prove  that  a  step  function  as  defined  above  is  a  function. 

(b)  Find  the  range  of  the  step  function. 

(c)  Can  the  range  of  a  step  function  have  infinitely  many  elements?  Explain. 

(d)  Suppose  that  we  have  a  function  g  :  I  — >  R,  where  7  is  an  interval  of  R  and  the 
range  of  g  is  finite.  Is  g  necessarily  a  step  function?  If  it  is,  prove  it;  if  it  isn’t, 
give  a  counterexample. 

Problem  14.8.  For  each  of  the  step  functions  below,  sketch  the  graph  and  identify 
the  range.  (Step  functions  are  introduced  in  Problem  14.7.) 

(a)  /:  [—5,5]  — >  R  defined  by 

/(x)  =  2X[-s-i]  -  3^(_u)  +5^{ij  +X(i,3)  +2X[3,5]; 

(b)  g  :  [-3,4)  ->  R  defined  by  g(x)  =  Ll=-3^kX[k,k+i)- 

Problem  14.9.  Write  each  of  the  following  functions  as  a  sum  of  products  of  charac¬ 
teristic  functions  and  other  well-known  functions.  Say  whether  or  not  your  function 
is  a  step  function  and  explain  your  answer.  (Characteristic  functions  are  introduced 
in  Problem  14.6  and  step  functions  in  Problem  14.7.) 

(  2  —  x  if  x  <  0 

(a)  /  :  R  — >  R  defined  by  /(x)  =  |  2  if0<x<2; 

[  x  else 

(b)  g  :  [0, 12]  — ►  R  given  by  the  graph  of  Figure  14.5  below; 


*4 

i  - 
1/2- 


-o 


t 


5 


o- 


10 


12 


► 

x 


Fig.  14.5 


(c)  h:R—>R  defined  by  h{x) 


sinx  if  0  <  x  <  2k 
0  otherwise 
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Problem  14.10.  Let  X  be  a  bounded  nonempty  subset  of  R.  Suppose  that  we  define 
g  :  ZP(X)  \{0}  — >  R  by  g(S)  =  sup  S.  Is  g  a  well-defined  function?  Why  or  why  not? 

Problem  14.11.  For  reRwe  define  the  greatest  integer  of  x  by  |xj  =  n,  where 
neZ  and  n  <  x  <  n  +  1 .  The  greatest  integer  function  or  floor  function  is  the 

function  /  :  R  — >  Z,  defined  by  f(x)  =  |_xj . 

(a)  Find  |_1/2J,  L^J,  L~ 3-5J,  and  10J . 

(b)  Prove  that  /  is  a  well-defined  function. 

(c)  What  is  the  range  of/?  (Show  all  work!) 

(d)  Sketch  the  graph  of  the  greatest  integer  (floor)  function  if  it  is  restricted  to 

— 5  <  x  <  5. 

Problem  14.12.  See  Problems  14.6,  14.7,  and  14.11  for  the  definitions. 

(a)  Write  the  greatest  integer  function  as  a  sum  of  characteristic  functions  (there 
may  be  more  than  one  way  to  do  this).  Depending  on  your  solution,  the  sum 
will  “appear  to  have  infinitely  many  terms,”  but  to  calculate  a  particular  value 
you  will  be  adding  only  finitely  many  nonzero  terms. 

(b)  Is  the  greatest  integer  function  a  step  function?  Explain  your  answer. 

Problem  14.13.  We  use  the  notation  of  Problem  14. 1 1  and  define  g  :  R  — >  R  by 
g(x)  =  l x\  +  [_— x\ .  Find  ran(g),  and  prove  that  your  answer  is  correct. 

Problem  14.14.  (a)  If  possible,  give  an  example  of  a  function  /  :  N  — »  R,  with 

ran(/)  =  dom(/). 

(b)  If  possible,  give  an  example  of  a  function  /  :  N  — »  R,  with  ran(/)  =  Z+. 

(c)  If  possible,  give  an  example  of  a  function  /  :  N  — »  R,  with  ran(/)  =  Z. 

Problem  14.15.  Let  /  :  R  \  {2}  — >  R  be  defined  by  f(x)  =  l/(x  —  2).  Find  ran(/) 
and  prove  that  your  answer  is  correct. 

Problem  14.16.  Consider  the  (well-defined)  function  /  :  R\  {3/2}  — >  R  defined  by 
f(x)  =  (jc  —  5) / (2jc  —  3).  Carefully  prove  that  ran (/)  =  R\  {l/2}. 

Problem  14.17.  Consider  the  (well-defined)  function  /  :  R\  {3/7}  — >  R  defined  by 
f(x)  =  (x  +  2) / (lx  —  3) .  Find  ran(/)  and  prove  that  your  solution  is  correct. 

Problem  14.18.  (a)  Give  an  example  of  a  function  /  from  R  x  R  to  R+. 

(b)  Give  an  example  of  a  function  /  from  Z  x  Z  to  N  such  that  ran(/)  =  N.  (Prove 
that  /  is  a  function  and  ran(/)  =  N.) 

(c)  Give  an  example  of  a  function  /  from  Z  x  Z  to  N  such  that  ran(/)  ^  N.  (Prove 
that  /  is  a  function  and  ran(/)  ^  N.) 

Problem  14.19.  Let  a,b,c,  and  d  be  real  numbers  with  a  <  b  and  c  <  d.  Let  \a . b] 
and  \c,d\  be  two  closed  intervals.  Find  a  function  /  such  that  /  :  [a,fe]  — >  \c,d\  and 
ran(/)  =  [c,r/].  Prove  everything. 

Problem  14.20.  (a)  Define  /  :  Z  — >  N  by  f(x)  =  |x|.  Is  /  a  function?  If  so,  deter¬ 

mine  ran(/). 
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(b)  Define  /  :  R2  — >  R  by  f(x,y)  =  x.  Is  /  a  function?  If  so,  determine  ran(/). 

Problem  14.21.  Suppose  that  /  is  a  function  from  a  set  A  to  a  set  B.  Thus,  we  know 
that  /  is  a  subset  of  Ax  B.  Is  the  relation  {(y,x)  :  (x,y)  £  /}  necessarily  a  function 
from  B  to  A?  Why  or  why  not?  (Say  as  much  as  is  possible  to  say  with  the  given 
information.) 

Problem  14.22.  Which  of  the  following  functions  equal  /  :  Z  — ►  Z  defined  by 
f{x)  =  |x|?  Prove  your  answers  (make  sure  you  show  that  the  functions  below  either 
equal  /  or  do  not  equal  /). 


(a)  The  function  g  :  Z  — >  Z  defined  by  g(x)  =  ^ ^ 

(b)  The  function  h  :  Z  — >  Z  defined  by  h{x)  =  Vx2. 

(c)  The  function  k  :  Z  — >  Z  defined  by  k(x)  =  |  *  ^ 


x2/|x|  ifx  ^  0 

0  if  x  =  0  ‘ 


Problem  14.23.  Let  X  be  a  nonempty  set.  Find  all  relations  on  X  that  are  both  equiv¬ 
alence  relations  on  X  and  functions  from  X  to  X. 

Problem  14.24.  We  can  now  define  the  indexing  process  more  rigorously  than  we 
were  able  to  in  Chapter  8.  Let  I  and  X  be  sets  and  /  :  I  — >  X  be  a  function.  Then 
we  call  the  domain  I  an  index  set  and  an  element  of  I  an  index.  The  range  of  the 
function,  denoted  here  {/(/)  :  j  £  /},  is  called  an  indexed  set.  If  the  set  X  C  .3^ ( Y ) 
for  some  set  Y  (and  thus  f  :  I  —>  £P(Y)),  then  {/{])  :  j  £  1}  is  usually  called  an 
indexed  collection  of  sets. 

As  a  specific  example,  consider 


/  :  Z+  — >  ^(R)  defined  by  f{n)  =  {x  £M. :  n —  2n  <x  <  7t +  2/ n}. 

(a)  Find  UnGz+/(«)- 

(b)  Find  f)nez+  f(n)- 


Chapter  15 

Functions,  One-to-One,  and  Onto 


Functions  can  map  elements  from  the  domain  to  the  codomain  in  many  ways.  A 
function  may  “hit”  every  element  in  the  codomain,  or  it  may  “miss”  some.  It  may 
assign  more  than  one  x  to  a  y  or  it  may  assign  exactly  one  x  to  each  y.  We  will 
understand  our  function  better  if  we  know  which  of  these  things  it  does.  Precise 
formulations  of  these  ideas  will  be  given  in  a  moment.  It’s  a  mouthful,  though,  and 
really  requires  practice. 

To  say  that  a  function  f  :  A  —>  B  is  one-to-one  means  that  for  all  a\  .(12  £  A, 
if  f{a\)  =  f(a.2 ),  then  a\  =  ai-  A  function  /  :  A  — >  B  is  onto  if  ran(/)  =  B.  If  a 
function  has  this  property,  then  we  say  that  /  maps  A  onto  B.  Some  authors  use  the 
word  injective  rather  than  one-to-one  and  surjective  rather  than  onto.  If  a  function 
is  both  one-to-one  and  onto  (or  injective  and  surjective),  then  we  say  the  function  is 
bijective. 

Diagrams  using  small  sets  may  help  illustrate  the  ideas  involved  in  these  def¬ 
initions.  We  sketch  two  functions  that  are  not  one-to-one  in  Figure  15.1  and  two 
functions  that  are  not  onto  in  Figure  15.2.  Can  you  make  a  diagram  for  a  function 
that  is  bijective? 

Before  moving  on,  let’s  think  about  these  definitions.  The  definition  of  one-to- 
one  is  an  implication  with  quantifiers  on  elements  of  the  domain  A.  It  moves  “for¬ 
ward,”  in  the  sense  that  we  start  with  elements  in  A  and  see  where  /  maps  them.  The 
definition  of  onto,  on  the  other  hand,  requires  that  we  show  something  about  every 
element  in  the  set  B.  It  will  require  us  to  move  “backward,”  in  the  sense  that  we  will 
start  with  something  in  B  and  see  what  element  of  A  is  mapped  to  it  under  /. 

If  you  want  to  show  that  a  function  is  onto,  we  said  that  you  must  check  that 
ran(/)  =  B.  Technically  that  would  mean  showing  ran (/)  C  B  and  B  C  ran (/).  But 
if  we  are  showing  that  a  function  f :  A  — >  B  is  onto,  we  already  know  that  ran(/)  C  B\ 
that  is,  we  get  that  half  for  free.  So  if  we  know  we  have  a  function  / :  A  — >  B  (maybe 
it  was  given  to  us;  maybe  we  showed  it),  to  show  that  /  maps  A  onto  B,  we  only 
have  to  check  that  B  C  ran(/).  And  what  does  this  mean?  The  answer  is  given  in  the 
lemma  below. 

Lemma  15.1.  Let  f  :  A  — >  B  be  a  function.  Then  f  maps  A  onto  B  if  and  only  if  for 
all  b  £  B  there  exists  (at  least  one)  a  £A  such  that  f{a)  =  b. 
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Fig.  15.1  The  function  /  does  not  map  onto  B,  but  the  function  g  does.  Neither  function  is  one-to- 
one 


Since  this  lemma  is  just  a  reformulation  of  the  definition  of  onto,  we  will  often 
use  it  without  explicitly  referencing  it. 

Let’s  run  through  some  of  these  ideas  in  slow  motion.  We’ll  begin  with  a  simple 
example,  and  move  on  to  a  more  challenging  one. 

Example  15.2.  Let  /  :  R  — >  R  be  defined  by  /(x)  =  2x  +  5.  Then  /  is  one-to-one. 

Proving  a  function  is  one-to-one  is  often  easier  than  proving  it  is  onto,  because 
one-to-one  doesn’t  have  that  backwards  quality.  To  prove  it,  we  assume  that  (for 
two  arbitrary  points  x\  and  xj  in  the  domain)  /(x i)  =  fixf),  and  show  that  x\  =  X2- 
So  now  we’ll  just  dive  in  here.  Don’t  forget  that  the  definition  of  one-to-one  is  an 
implication  and  therefore  we  expect  to  use  our  assumption. 

Proof.  Let  xi  ,X2  £  R.  If  /(x r)  =  /(x 2),  then  2xi  +5  =  2x2  +  5.  Simplification  yields 
xj  =  X2,  as  desired.  Therefore  /  is  one-to-one.  □ 

We  turn  to  a  more  interesting  example.  Note  that  even  though  the  functions  in 
these  two  examples  are  quite  different,  the  proofs  that  they  are  one-to-one  are  quite 
similar. 

Example  15.3.  In  Example  14.8,  we  considered  /  :  R  \  {3}  — >  R  defined  by  /(x) 
=  (x  +  1 ) / (x  —  3).  We  started  with  an  element  y  and  solved  for  x.  We  found  an  x 
that  corresponded  to  y,  but  there  may  be  others.  Is  the  function  one-to-one  (meaning 
there  aren’t  any  others)? 

We  claim  that  /:IR\{3}— >R.  defined  by  /(x)  =  (x+  1 )  /  (x  —  3)  is  one-to-one. 
Proof.  Letxi,X2  £  R\  {3}.  If  /(x  1)  =  /(x 2),  then 

xi  +  1  _  X2  +  1 

xi  —  3  X2  —  3 
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Multiplying  through  yields  X1X2  +X2  —  3x\  —  3  =  X2X\  +xi  —  3x2  —  3.  Canceling, 
combining,  and  dividing  by  4,  we  find  that  x\  =  xo,  as  desired.  Therefore  /  is  one- 


□ 


to-one. 


Note  that  /:R\{3}^Ris  not  onto.  This  follows  from  our  work  in  Exam¬ 
ple  14.8,  where  we  showed  that  ran (/)  =  R  \  {1}  ^  R.  The  work  of  that  example 
together  with  the  proof  above  show  that  /:R\{3}— >M\{l}isa  bijection.  O 

Exercise  15.4.  What  do  you  need  to  do  in  order  to  show  that  a  function  is  not  one- 
to-one?  Use  what  you  just  decided  to  show  that  the  function  /  :  R  — >  R  defined  by 
f(x)  =  x2  is  not  one-to-one.  O 

Exercise  15.5.  In  Exercise  14.9,  which  of  the  functions  are  one-to-one?  If  they  are 
not  one-to-one,  show  that  carefully  as  well.  O 

It  is  now  time  to  investigate  what  it  really  means  when  we  say  that  a  function 
maps  a  set  A  onto  a  set  B. 

Example  15.6.  Prove  that  the  function  /  :  R  — >  R  defined  in  Example  15.2  by 
f{x)  =  2x  +  5  is  onto. 

We  first  devise  our  plan. 

“Devising  a  plan.”  We  have  checked  that  /  :  R  — >  R  is  well-defined,  so  (in  view 
of  Lemma  15. 1)  we  let  y  £  R.  We  must  show  that  y  =  f(x)  for  some  x  £  R.  Thus,  we 
must  show  that  y  =  2x  +  5  for  some  x  £  R.  It  is  now  easy  to  see  that  x  =  (y  —  5) /2 
will  work.  Remember,  when  we  write  this  up,  we  will  act  as  though  the  reader  has 
not  seen  this  work. 

Proof.  Let  y  £  R  and  let  x  =  (y  —  5) /2.  Then  x  £  R  =  dom(/)  and 


Since  /  :  R  — >  R  is  a  well-defined  function,  /  is  onto  (by  Lemma  15.1). 


□ 


Functions  that  are  defined  in  cases  will  play  an  important  role  in  the  rest  of  this 
course.  They  are  also  illuminating  examples,  because  they  show  just  how  much  one- 
to-one  depends  on  “what  goes  into  /”  and  just  how  much  onto  depends  on  “what 
comes  out.”  Showing  that  they  are  one-to-one  and  onto  is  a  bit  tricky.  We’ll  go 
through  one  example  carefully  first. 

Example  15.7.  We  will  show  that  /  :  Z  — »  N  as  defined  below  and  graphed  in  Fig¬ 
ure  15.3  is  a  bijective  function: 


2  n  if  n  >  0 

—2  n  —  1  if  n  <  0 
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Fig.  15.3 


We  should  mention  something  before  we  begin.  We,  the  authors  of  this  text, 
checked  that  the  /  in  this  example  is  really  a  function  from  Z  to  N.  But  don’t  trust 
us;  check  it  again.  Remember  to  check  that  /  assigns  a  value  in  N  to  each  element 
of  Z,  and  that  /  assigns  at  most  one  value  in  N  to  each  integer.  We  now  begin  our 
example. 

“Devising  a  plan.”  Now  to  prove  that  /  is  one-to-one,  we  must  show  that  if  m 
and  n  are  integers,  and  /( m)  =  /(«),  then  m  =  n.  But  it’s  sort  of  confusing  here, 
since  we  have  more  than  one  choice  for  f(m)  and  /(«).  The  problem  seems  to  be 
that  the  function  is  defined  in  cases,  and  it’s  not  immediately  clear  which  case  to 
use.  The  choice  depends  on  whether  m  and  n  are  negative  or  nonnegative.  So  we’ll 
break  this  into  all  possible  cases.  Let’s  see.  Both  could  be  nonnegative  (case  1),  both 
negative  (case  2),  or  one  nonnegative,  one  negative  (case  3).  So  let’s  check  them  all. 

Proof.  [Proof  that  /  is  one-to-one]  Let  m,  n  £  Z  and  suppose  that  f(m)  =  f(n). 

Case  1.  Suppose  that  m  >  0  and  n  >  0.  Then  f(m)  =  2m  and  f(n)  =  2 n.  Thus 
2m  =  2 n,  and  therefore  m  =  n. 

Case  2.  Suppose  that  m  <  0  and  n  <  0.  Then  f{m)  =  —2 m  —  1  and  f{n)  =  —2 n  —  1 . 

Thus,  —  2m  —  1  =  2n  —  1,  and  therefore  m  =  n. 

Case  3.  Suppose  that  one  of  the  two,  say  m,  is  nonnegative,  and  the  other  is  nega¬ 
tive.  Then  f[m )  =  2m  and  /(«)  =  —2 n  —  1.  Thus  2m  =  —2 n  —  1.  But  this  means 
that  an  even  number,  2m,  is  equal  to  an  odd  number,  2n  —  1,  which  is  impossi¬ 
ble. 

Therefore,  if  f(m)  =  f(n),  only  cases  1  and  2  can  occur.  In  either  of  these  cases,  we 
have  shown  that  m  =  n.  Thus  /  is  one-to-one.  □ 

Now  we  will  show  that  /  maps  Z  onto  N. 

“Devising  a  plan.”  Recall  that  to  show  /  is  onto,  we  must  show  that  ran(/)  =  N. 
We  (you,  actually)  already  checked  that  /  is  a  well-defined  function  from  Z  to  N,  so 
ran(/)  C  N.  So  let  k  £  N.  We  wish  to  find  m  such  that  f(m)  =  k  and  m  €  Z.  Again, 
because  there  are  cases,  this  is  a  bit  confusing.  So  let’s  try  to  think  about  this  before 
we  really  begin.  We’ll  try  to  do  this  for  a  few  specific  points.  Though  this  won’t 
prove  anything,  it  may  tell  us  how  to  begin  our  proof.  So  moving  along  in  the  spirit 
of  showing  something  is  onto,  if  f(m)  =  4,  what’s  ml  (It’s  2.)  If  f(m)  =  3,  what’s 
ml  (It’s  —2.)  Now  maybe  you  see  it.  If  you  don’t,  keep  trying  until  you  do.  After  a 
while,  you  should  see  that  our  choice  of  m  depends  on  whether  k  is  even  or  odd. 
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Proof.  [Proof  that  /  maps  Z  onto  N]  Let  k  £  N.  If  k  is  even,  then  k  =  2m  for  some 
m  £  Z  with  m  >  0.  Thus,  m  £  Z  and  f(m)  =  2m  =  k.  If  k  is  odd,  then  k+  1  is  even. 
Hence  m=  (k  +  1)/ (—2)  £  Z.  Since  k  >  1,  we  have  m  <  0.  Thus,  f(m )  =  — 2m —  1 
=  —  2((k+ 1)/ (—2))  —  1  =  k.  We  conclude  that  for  all  k  £  N,  there  exists  m  £  Z  such 
that  f(m)  =  k.  Since  /  :  Z  — >  N  is  a  well-defined  function,  /  maps  Z  onto  N.  □ 

Note  that  though  the  functions  in  the  last  two  examples  are  quite  different,  the 
proofs  that  they  are  onto  are  quite  similar. 

What  about  an  example  of  a  function  f  :  A— >  B  that  does  not  map  A  onto  B2  Our 
Lemma  15.1  is  quite  handy  here:  Let  /  :  A  — >  B  be  a  function.  Then  /  is  not  onto  if 
there  exists  b  £  B  such  that  f{a)  ^  b  for  every  a  £A. 

For  example,  consider  the  function  / :  R  — >  R  defined  by  f{x)  =  x2.  Then  to  show 
that  /  does  not  map  onto  R,  we  note  that  —  1  £  R,  but  there  exists  no  a  £  R  such 
that  f(a)  =  —  1 .  You  should  be  able  to  show  that  /  does  map  R  onto  the  nonnegative 
real  numbers. 

Exercise  15.8.  In  Exercise  14.9,  which  of  the  functions  map  onto  their  codomain? 
If  they  do  not  map  onto,  show  that  carefully  as  well.  O 

We  now  turn  to  a  concrete  example  of  a  function  /  and  a  special  way  to  construct 
a  function  (called  the  restriction)  that  is  related  to  /  in  a  very  special  way. 

Suppose  that  the  set  A  contains  all  the  buses  in  the  city  of  Bern,  Switzerland,  and 
the  function  f  :  A  —>  Z+  enumerates  them  by  assigning  a  positive  integer  to  each 
one.  We  will  let  C  denote  the  subset  of  A  that  contains  all  the  buses  that  run  on 
biogas.  There  are  now  enough  biogas  buses  available  to  serve  the  city  on  a  regular 
day  and,  therefore,  we  are  interested  in  enumerating  just  the  biogas  buses.  If  we 
wish  to  select  only  the  biogas  buses,  we  would  be  interested  in  the  enumeration  of 
the  set  C;  that  is,  we  use  the  old  function  but  consider  only  buses  in  the  set  C.  This 
is  easily  done,  by  restricting  the  function  /  from  A  to  C.  Formally  speaking,  the 
restriction  is  the  following:  Let  /  :  A  —  B  be  a  function  and  CCA.  The  restriction 
of  /  to  C  is  the  function  f\c  :  C  — >  B  defined  by  / 1 c  (x)  =  f(x)  for  all  x  £  C. 

Exercise  15.9.  Let  /  :  R  — >  R  be  a  function. 

(a)  Give  an  example  of  a  function  /  and  a  nonempty  subset  A  of  R  such  that  /  is 
not  one-to-one,  but  /|a  is  one-to-one. 

(b)  Give  an  example  of  a  function  /  and  a  nonempty  subset  A  of  R  such  that 
/\a  :  A  — >  R  is  not  onto,  but  /  is  onto. 

(c)  Are  there  examples  of  /  and  ACR  such  that  /  is  a  bijection,  but  f\  a  :  A  — >  R 
is  not?  Give  an  example  or  explain  why  none  exists. 

(d)  Are  there  examples  of  /  and  ACR  such  that  /\a  :  A  — >  R  is  a  bijection,  but 
/  is  not?  Give  an  example  or  explain  why  none  exists. 
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Definitions 

Definition  15.1.  A  function  / :  A  — > B  is  one-to-one  or  injective  if  for  all  a\  .ai  £  A, 

whenever  f(a\ )  =  /(a2),  then  a\  =  ci2- 

Definition  15.2.  A  function  /  :  A  — >  B  is  onto  or  surjective  if  ran(/)  =  B. 

Definition  15.3.  A  function  / :  A  —  B  is  bijective  if  it  is  injective  and  surjective  (or, 
equivalently,  one-to-one  and  onto). 

Definition  15.4.  Let  /  :  A  — >  B  be  a  function  and  CCA.  The  restriction  of  /  to  C 
is  the  function  f\c  :  C  — >  B  defined  by  / 1 c  {x)  =  f(x)  for  all  x  £  C. 


Solutions  to  Exercises 

Solution  (15.4).  Carefully  negating  the  definition,  we  see  that  to  show  a  function 
/  :  A  — >  B  is  not  one-to-one,  we  must  show  that  there  exist  x\  and  xi  in  A  with 
f(x i)  =  f(x 2)  and  xi  ^  X2.  For  the  particular  function  /(x)  =  x2  given  above,  we 
note  that  the  numbers  1  and  —1  are  both  real  numbers  with  /( 1 )  =  /(— 1),  but 
1^—1.  Therefore  /  is  not  one-to-one. 

Solution  (15.5).  Only  the  function  defined  in  (a)  is  one-to-one:  If xi,x'2  £l\ {0} 
and  /(x  1)  =  /(x 2),  then  1  /x\  =  1  /x2-  Therefore,  xi  =  X2,  showing  that  /  is  one-to- 
one. 

For  part  (b)  consider  (1, 1),  (2,2)  £  Z  x  (Z\  {0})  =  dom(/).  Then  (1, 1)  ^  (2,2), 
but  /( 1, 1)  =  1  =  /( 2,2).  This  shows  that  /  is  not  one-to-one. 

For  part  (c),  let  —  1,  —  3  £  R  =  dom(/).  Then  /(— 1)  =  2  =  /(— 3)  and  hence  / 
is  not  one-to-one. 

Solution  (15.8).  None  of  the  functions  maps  onto  their  codomain.  The  range  of 
each  function  was  presented  in  the  solution  to  Exercise  14.9. 

Solution  (15.9).  We  will  need  only  two  functions  to  solve  all  four  parts  of  this 
problem.  Of  course,  many  other  examples  are  possible. 

(a)  Let  / :  R  — >  R  be  defined  by 


and  let  A  =  K\  (0, 1]  C  R.  Clearly  /  is  not  one-to-one,  but  /|jj\(o  1  is. 

(b)  Let  /  :  R  — >  R  be  defined  by  /(x)  =  x  and  let  A  =  Z.  Then  /  is  onto  but 


/ |z  :  Z  — >  R  is  not. 


(c)  The  example  presented  in  part  (b)  works  here  also. 

(d)  The  example  presented  in  part  (a)  works  here  also. 
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Problems 

Problem  15.1.  For  each  of  the  following,  you  are  asked  to  give  an  example  of  a 
function.  (You  should  always  state  the  domain,  codomain,  and  the  associated  rule  of 
your  function.) 

(a)  Give  an  example  of  a  function  that  is  both  one-to-one  and  onto. 

(b)  Give  an  example  of  a  function  that  is  one-to-one,  but  not  onto. 

(c)  Give  an  example  of  a  function  that  is  not  one-to-one,  but  is  onto. 

(d)  Give  an  example  of  a  function  that  is  neither  one-to-one  nor  onto. 

Problem  15.2.  (a)  Let  /  :  R  — >  R  be  defined  by  f(x)  =  x2.  Show  that  /  is  not 


onto. 

(b)  Show  that  /,  as  defined  above,  maps  R  onto  {x  £  R  :  x  >  0}. 

(c)  Consider  the  function  g  :  Z  — >  N  defined  by  g(x)  =  x2.  Is  g  onto? 


(d)  Both  g  and  /  take  elements  of  the  domain  and  square  them.  Why  did  we  use 
the  letter  g  in  the  previous  part  of  this  problem,  rather  than  the  letter  /? 

Problem  15.3.  Is  the  absolute  value  function  /  :  R  — >  R  defined  by  f(x)  =  |x|  one- 
to-one?  Why  or  why  not? 

Problem  15.4.  (a)  Is  there  a  one-to-one  function  from  the  set  {1,2,3}  to  the  set 

{1,3}?  Why  or  why  not? 

(b)  Is  there  a  function  mapping  {1,2,3}  onto  the  set{l,3}?  Why  or  why  not? 

(c)  Is  there  a  one-to-one  function  mapping  the  open  interval  (0,2)  to  the  open 
interval  (0,1)? 

(d)  Is  there  a  one-to-one  function  mapping  the  set  {x  G  R  :  x  >  0}  to  the  open 
interval  (0,1)? 

(e)  Is  there  a  function  mapping  the  set  {x  G  R  :  x  >  0}  onto  the  open  interval 


(0,1)? 


Problem  15.5.  Recall  Definition  15.1  of  one-to-one. 

(a)  State  the  contrapositive  of  the  definition. 

(b)  By  negating  the  contrapositive,  complete  the  following  definition.  A  function 

/  :  X  — >  Y  is  not  one-to-one  if _ 

Problem  15.6.  Criticize  the  following  “definition”  of  onto.  “A  function  /  :  X  — *  Y 
is  onto  if  there  exists  an  x  £  X  such  that  for  each  y  £  Y  we  have  f(x)  =  y.” 

Problem  15.7.  Define  /  :  R  — >  R  by  f(x)  =  5  +  {x—  3)2. 

(a)  Prove  that  /  is  not  injective. 

(b)  Find  ran(/)  and  prove  that  your  conjecture  is  correct. 

Problem  15.8.  We  define  /  :  R  — >  R  by 
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This  function  is  well-defined  and  you  may  assume  this  without  proving  it.  Prove 
that  this  function  is  bijective. 

Problem  15.9.  We  define  / :  R  — *  R  by 


2x  +  4  if  x  <  —2 

if  2  <  x  <  2  . 
- x  if  x  >  2 


This  function  is  well-defined  and  you  may  assume  this  without  proving  it.  Prove 
that  this  function  is  bijective. 

Problem  15.10.  For  each  of  the  functions  below,  determine  whether  or  not  the  func¬ 
tion  is  one-to-one  and  whether  or  not  the  function  is  onto.  If  the  function  is  not 
one-to-one,  give  an  explicit  example  to  show  what  goes  wrong.  If  it  is  not  onto, 
determine  the  range. 

(a)  Define  /  :  R  — >  R  by  /(x)  =  1  /  (x2  +  1 ) . 

(b)  Define  /  :  R  — >  R  by  /(x)  =  sin(x).  (Assume  familiar  facts  about  the  sine 
function.) 

(c)  Define  /  :  Z  x  Z  — >  Z  by  f(n,m)  =  nm. 

(d)  Define  /  :  R2  x  R2  — >  R  by  /( (x,  y) ,  (u ,  v) )  =  xu  +  yv.  (Do  you  recognize  this 
function?) 

(e)  Define  /  :  R2  x  R2  — >  R  by  /((x,y),  (h,  v))  =  y/(x  —  u)2  +  (y  —  v)2.  (Do  you 
recognize  this  function?) 

(f)  Let  A  and  B  be  nonempty  and  b  £  B.  Define  /  :  A  — >  A  x  B  by  f(a)  =  ( a,b ). 

(g)  Let  X  be  a  nonempty  set.  Define  /  :  £P(X)  — >  &(X)  by  /(A)  =  X  \  A. 

(h)  Let  B  be  a  fixed  proper  subset  of  a  nonempty  set  X.  We  define  a  function 
/  :  &(X)  -»•  &>(X)  by  /(A)  =  AflB. 

(i)  Let  / :  R  — >  R  be  defined  by 


2  —x  if  x  <  1 


1/x  otherwise 


Problem  15.11.  Definition  14.4  introduced  the  characteristic  function  %,\  of  a  subset 
A  of  a  setX. 

(a)  Under  what  conditions  on  the  sets  A  and  X  is  the  function  %a  ■  X  —>{0,1} 
onto?  Justify  your  answer. 

(b)  Under  what  conditions  on  the  sets  A  and  X  is  the  function  %a  ■  X  —>{0,1} 
one-to-one?  Justify  your  answer. 

(c)  Under  what  conditions  on  the  sets  A  and  X  is  the  function  Xa  '■  X  — >  {0, 1} 
bijective?  Justify  your  answer. 

Problem  15.12.  For  each  of  the  following,  determine  whether  or  not  /  :  A  — >  B  is 
a  function  from  the  set  A  to  the  set  B.  If  it  is,  prove  that  /  is  one-to-one,  or  give 
an  example  to  show  that  /  is  not  one-to-one.  Then  prove  that  /  is  onto,  or  give  an 
example  of  an  element  in  the  codomain  that  is  not  in  the  range  to  show  that  /  is  not 
onto. 
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(a)  Define  /  :  ZxZ— s-ZxZby  f(x,y)  =  (y,x). 

(b)  Define  /  :  Z  x  Z  — >  Z  by  f(x,y)  =  x2  +  y2. 

(c)  Let  y  £  R.  Define  /  :  R  — >  R  by  /(x)  =  y  ■  x.  (Does  your  answer  depend  on 

y?) 

(d)  Define  /  :  ^(Z)  — >  Z  by  /(5)  =  maxS. 

Problem*  15.13.  Let  /  :  R  — >  (— 1, 1)  be  defined  by 


Prove  that  /  is  a  bijective  function,  mapping  M  onto  the  open  interval  (—1,1). 

Problem  15.14.  Let  a,b,c,  and  d  be  real  numbers  with  a  <  b  and  c  <  d.  Define  a 
bijection  from  the  closed  interval  [a.b\  onto  the  closed  interval  \c.d\  and  prove  that 
your  function  is  a  bijection. 

Problem  15.15.  Let  F([0, 1])  denote  the  set  of  all  real-valued  functions  defined  on 
the  closed  interval  [0, 1],  Define  a  new  function  0  :  F([0, 1])  — >  R  by  0(/)  =  /( 0). 
Is  0  a  function  from  F([0, 1])  to  R?  Is  it  one-to-one?  Is  it  onto?  Remember  to  prove 
all  claims,  and  to  provide  examples  where  appropriate. 

Problem  15.16.  Find  a  function  /  :  R  — >  R+  that  is  one-to-one. 

Problem  15.17.  Let  /  be  a  function,  / :  R  — >  R.  Define  a  new  function  /  •  / :  R  — >  R 
by  (/  ■  /)  (x)  =  /(x)  -  fix).  Prove  that  f  ■  f  is  a  function.  Then  do  the  remaining  parts 
of  the  problem.  (You  may  wish  to  work  Problem  15.16,  if  you  haven’t  already  done 
so.) 

(a)  Does  there  exist  a  function  /  for  which  /  •  /  is  one-to-one?  If  not,  why  not? 
If  there  is,  what  is  an  example? 

(b)  Does  there  exist  a  function  /  for  which  /  •  /  maps  onto  R?  If  not,  what  is 
ran (/  •  /)?  Your  answer  will  be  in  terms  of  ran(/). 

Problem  15.18.  Let  f  :  A  ^  B  be  a  function  and  CCA.  Prove  that  f\c  is  a  well- 
defined  function. 

Problem*  15.19.  Let  /  :  A  — >  B  be  a  function  and  CCA. 

(a)  Prove  that  if  /  is  one-to-one,  then  f\c  is  one-to-one. 

(b)  Prove  that  if  f\c  is  onto,  then  /  is  onto. 

Problem  15.20.  Let  /  :  X  — >  Y  and  g  :  X  — >  Y  be  functions  and  let  ,<//  denote  a 
partition  of  X.  Show  that  if  / =  g\^  for  all  A  €  srf,  then  /  =  g. 

Problem  15.21.  Let  /  :  R  — >  R  be  /(x)  =  x3  —  3x  and  A  =  ^  —  oo,  —  \/3)  U  [x/3,  °°)-  Is 
/  a  bijection?  Is  f\&  :3-»Ra  bijection?  (Your  solution  may  involve  the  intermedi¬ 
ate  value  theorem.) 


166 


15  Functions,  One-to-One,  and  Onto 


Problem  15.22.  Show  that  the  function  /  :  R  — >  R  defined  by  f(x)  =  |x|  +  2x  is 
bijective.  You  might  find  it  more  convenient  to  write  /  as  a  function  defined  in 
cases. 


Problem*  15.23.  Define  / :  (— 1, 1)  — >  R  by 


/0)  = 


(a)  Show  that  /  is  injective. 

(b)  Show  that  /  is  surjective. 


Problem  15.24.  Let  a,b,c,  and  d  be  real  numbers,  not  both  of  c  and  d  zero.  Further, 
let  X  =  {x  G  R  :  cx+  d  ^  0}.  The  function  /  :  X  — >  R  defined  by 


/(*) 


ax  +  b 
cx  +  d ’ 


is  well-defined  (you  need  not  prove  that). 

(a)  Under  what  conditions  (i.e.,  restrictions  on  a,b,c,d)  will  /  be  one-to-one? 
Prove  your  conjecture. 

(b)  Assume  that  the  conditions  that  you  proposed  in  part  (a)  hold  and  that,  in 
addition,  c  =  0.  Show  that  in  this  case,  /  is  a  bijection  from  R  to  R. 


Problem  15.25.  Let  ~  be  an  equivalence  relation  on  a  nonempty  set  X.  We  denote 
the  set  of  all  equivalence  classes  of  ~  by  ^ .  Define  /  :  X  — ►  ^  by  f(x)  =  Ex.  Thus, 
an  element xSlis  mapped  to  the  equivalence  class  containing x. 

(a)  Prove  that  /  is  a  well-defined  function. 

(b)  Prove  that  /  is  suijective. 

(c)  Give  necessary  and  sufficient  conditions  on  the  equivalence  relation  for  /  to 
be  bijective. 


Problem  15.26.  Let  /  :  A  — ^  B  be  a  surjective  function.  For  x,y  £  A,  we  say  x  ~  y  if 
and  only  if  f{x)  =  f(y). 

(a)  Prove  that  ~  is  an  equivalence  relation  on  A. 

We  denote  the  collection  of  all  equivalence  classes  of  ~  by  Yi  Define  g  :  — >  B  by 

g{Ex)  =  f(x). 

(b)  Prove  that  g  is  a  well-defined  function. 

(c)  Prove  that  g  is  bijective. 


Chapter  16 

Inverses 


Given  functions  /  :  A  — >  B  and  g  :  C  — >  D  with  ran(/)  C  C,  we  can  define  a  third 
function  called  the  composite  function  from  A  to  D.  (We  will  usually  call  this  the 
composition,  rather  than  the  composite  function.)  This  composition  is  the  function 
go  f  :  A  — *•  D  defined  by  (gof)(x)  =  g(f(x)).  So,  for  example,  if  /  :  R  — >  R  and 
g  :  R  — »  R  are  defined  by  f(x)  =  x2  and  g(x)  =  sin(x),  then  (go  f)(x)  =  g(f(x )) 
=  g(x2)  =  sinfx2).  Note  that  the  order  really  matters  here.  Using  /  and  g  as  above, 
for  example,  (fog)(x)  =  f(g(x))  =  /(sin(x))  =  (sin(x))2.  You  can  check  pretty  eas¬ 
ily  that  these  two  functions  are  different.  (Check  this  pretty  easily.)  So  composition 
of  functions  is  not  commutative. 

Consider  the  two  functions  in  Figure  16.1.  Here  /  :  A  — >  B,  where  A  =  {a.b.c] 
and  B  =  {1,2, 3, 4},  while  g  :C  — >  D,  where  C  =  {2, 3, 4, 5, 6}  and  D  =  {a, /3 ,  y}. 
Then  ran(/)  C  C,  so  the  composition  go  f  is  defined.  To  determine  the  action  of 
go  f  algebraically,  use  the  definition  of  each.  For  example,  (go  f)(a)  =  g(f(a)) 
=  g( 2)  =  j3.  To  determine  the  action  visually,  follow  the  arrows,  remembering  that 
/  goes  first. 

Take  this  opportunity  to  check  that  the  composition  of  three  functions  satisfies 
the  associative  property.  In  other  words,  if  we  have  three  functions  f,g,  and  h  so 
that  the  composition  makes  sense  (what  would  that  mean?),  then 

ho(gof )  =  (hog)of. 

We’ll  use  this  result  in  this  chapter. 

Sometimes  it  is  useful  to  “undo”  the  action  of  a  function  /.  If  /  maps  3  to  5, 
we  might  wish  to  “undo”  that  by  finding  a  function  that  takes  5  back  to  3.  This  is 
most  useful  when  we  can  undo  the  action  of  /  on  the  whole  range,  not  at  just  one 
point,  because  then  every  element  ends  up  back  where  it  started.  For  example,  if  / 
cubes  all  the  values  in  its  domain,  we  can  “reverse”  that  action  by  taking  the  cube 
root.  Mathematically  what  this  means  is  that  if  /  :  R  — >  R  is  defined  by  f(x)  =  x3, 
then  g  :  R  — »  R  defined  by  g(x)  =  x1/3  satisfies  two  things:  (go  f)(x)  =  x  for  all 
x  £  dom(/) ,  and  (fog)(y)  =  y  for  all  y  £  dom(g) . 
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But  what  happens  if  /  :  M  — >  R  is  defined  by  f(x)  =  x2?  If  we  want  g  to  “undo” 
this  action,  then  we  want  g  to  satisfy  (g  o  /)  (x)  =  x  for  all  x  £  R.  But  if  x  =  2,  we 
need  (g  o  /)  (2)  =  g( 4)  =2  and,  if  x  =  —2,  we  need  ( g  o  f)  (—2)  =  g  (4)  =  —2  (see 
Figure  16.2).  What’s  the  problem  here?  Well,  g  is  not  allowed  to  assign  two  different 
values  to  the  number  4.  So  we  can’t  do  this  for  all  functions.  When  can  we  do  it? 
(Think  first,  read  on  later.) 

Suppose  that  a  function  is  bijective.  Then,  rather  than  looking  in  the  domain  and 
asking  what  x  gets  mapped  to,  we  can  look  in  the  range  at  y  and  ask  where  it  came 
from.  Since  the  function  is  onto,  y  came  from  some  x.  Since  the  function  is  one-to- 
one,  y  came  from  exactly  one  x.  So  we  can  define  an  inverse  function  as  follows. 

Let  f :  A  — >  B  be  a  bijective  function.  The  inverse  off  is  the  function  /  1  :  B  —  A 
defined  by 


f  1  (y)  =  x  if  and  only  if  f{x)  =  y. 

Whenever  we  define  a  function,  we  have  to  ask  ourself:  “Is  it  well-defined?”  Is  it? 
The  domain  is  defined  to  be  B.  By  definition,  the  value  of  an  element  of  B  under  /  1 
is  some  x  £  A.  Hence  A  qualifies  as  a  codomain  of  /  1 .  Now  we  check  condition 
(i)  of  the  definition  of  a  function.  Let  b  £  B.  Since  /  is  onto,  there  exists  an  element 
a  £  A  such  that  f[a)  =  b.  Hence  /  1  (b)  =  a  is  defined  and  property  (i)  holds.  For 
property  (ii)  we  assume  that  there  is  an  element  b  £  B,  and  elements  a  and  c  in  A 
such  that  f~l(b)  =  a  and  =  c.  By  the  definition  of  we  have  f{a)  =  b 


y 


Fig.  16.2  f(x)  x2 
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Fig.  16.3  f(x)  =  x3  —  5 

and  /(c)  =  b.  Hence  f{a)  =  f(c)  and  since  /  is  one-to-one,  a  —  c.  This  shows  that 
property  (ii)  holds  and  we  conclude  that  /  1  is  well-defined.  Note  that  this  function 
is  only  defined  in  the  case  when  /  is  bijective. 

The  discussion  in  the  last  paragraph  shows  that  /  is  indeed  a  function.  Thus 
we  have  shown  that  if  /  :  A  — >  B  is  a  bijective  function,  then  there  exists  an  inverse 
function  g:B^>A.  The  remainder  of  this  chapter  will  be  spent  understanding  inverse 
functions.  In  particular,  we  will  show  that  an  inverse  function  is  unique  and  we  will 
speak  of  “the”  inverse  of  f. 

Example  16.1.  We  define  /  :  K.  — >  R.  by  fix)  =  x3  —  5.  Graph  the  function  /.  Then 
prove  that  /  is  one-to-one  and  onto.  Once  you  have  done  that,  decide  what  /  1  is. 

“Devising  a  plan.”  Assume  for  the  moment  that  we  know  that  /  is  bijective,  so 
that  we  know  that  f~l  exists.  To  find  /  1 ,  we  use  what  we  know:  /  1  (y)  =  x  if  and 
only  if  /(x)  =  y.  Thus  we  must  solve  x3  —  5  =  y  for  x.  Once  we  solve  this  equation, 
we  find  thatx  =  (y  +  5)1/3.  Now  we  are  ready  to  solve  this  problem. 

Proof.  We  first  prove  that  /  is  one-to-one.  So  let  xi  and  X2  be  real  numbers.  If 
/(x i)  =  /(x 2),  then  x3  —  5  =  x'2  —  5.  Hence  x3  =  x\.  We  factor  the  difference  to 
get  x3  —  x'2  =  (xi  —  X2)  (xj  +  X1X2  +  x?) .  Thus,  x\  =  X2  or  x\  +  x\X2  +  x\  =  0.  In  the 
latter  case,  we  write  x3  +X1X2  -Tx3  =  (xi  +X2)2  ~xiX2  =  0.  So,  we  would  have 
X1X2  =  (xi  +X2)2  >  0,  and  this  would  mean  that  all  three  terms  in  x2  +X1X2  +X2  are 
nonnegative.  The  only  way  this  sum  can  be  zero  is  if  each  summand  is  zero.  Thus 
xi  =  X2  =  0.  Therefore  xi  =  X2  and  we  may  conclude  that  /  is  one-to-one.  (Alterna¬ 
tively,  you  can  use  the  remark  following  the  discussion  and  proof  of  Theorem  13.2.) 

Now  we  show  that  /  is  onto.  Let  y  £  R.  and  set  x  =  (y  +  5) .  (See  the  remark 
on  the  existence  of  7?th  roots  following  the  discussion  and  proof  of  Theorem  13.2.) 
Then  x  S  K,  and  /(x)  =  ((y  +  5)1/3)3  —5  =  y.  Since  /  :  R  — >  R  is  a  well-defined 
function.  Lemma  15.1  implies  that  /  is  onto. 
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Finally,  we  claim  that  /_1  :  R  — >  R  is  defined  by  =  (y  +  5)1/3.  So  let 

g(y)  =  (y  +  5)1/3.  If  we  can  show  that  g(y)  =  x  if  and  only  if  f(x)  =  y,  then  g 
satisfies  the  definition  of  inverse  function  and  we  can  conclude  that  g  =  f  1 .  So  let 
y  £  R.  Then  x  =  g(y)  =  (y  +  5)  */3  if  and  only  if  x3  =  y  +  5.  The  last  equality  holds 
if  and  only  if  y  =  x3  —  5  =  f(x).  Therefore  x  =  g(y)  if  and  only  if  y  =  f(x)  and 

/_10)  =  g(y)  =  0+ 5) 1/3-  □ 

We’ll  have  another  way  to  show  that  a  function  g  is  the  inverse  of  a  function  / 
as  soon  as  we  prove  Theorem  16.4  below.  That  theorem  will  make  our  life  easier. 
However,  until  we  prove  that  theorem,  we’ll  have  to  resort  to  the  definition  because 
it’s  all  we  have. 

The  example  above  brings  up  an  important  point.  Students  often  confuse  the 
notation  /  1  with  1//.  In  the  example  above  1  //  would  be  the  function  defined 
for  x  ^  51/3  by  l/(x3  —  5),  while  we  have  seen  that  f~l  is  defined  on  all  of  R  by 
/_1(x)  =  (x  +  5)1'  3.  These  two  functions  are  really  quite  different!  In  fact,  /-1  and 
1  /f  are  rarely  the  same.  (See  Project  29.7  for  more  information.) 

You  may  also  be  wondering  whether  the  original  function  will  always  be  defined 
in  terms  of  x  and  the  inverse  function  in  terms  of  v.  The  answer  is:  Of  course  not. 
For  one  thing,  there  is  nothing  “original”  about  the  first  function;  we  might  just 
as  well  have  started  with  the  “inverse”  function  in  the  example  above  (see  Prob¬ 
lem  16.9).  What  we  decide  to  call  the  variable  is  irrelevant:  For  example,  the  func¬ 
tions  g  :  R  — >  R.  and  /; :  R  — >  R  defined  by  g(x)  =  (x  +  5)1/3  and  h(y)  =  (y  +  5)1/3 
are  the  same  function. 

Example  16.2.  Let  / :  R\{3}— »R\{l}be  defined  by  f(x)  =  (x+  l)/(x  —  3).  (You 
should  graph  the  function  /,  and  compare  it  to  our  graph  in  Figure  16.4.)  We’ll  find 
/_1  :R\{1}— >R\{3}. 


Fig.  16.4  /(*)  =  (*+ !)/(*- 3) 
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Before  you  read  our  solution,  use  Examples  14.8  and  15.3  to  check  that  this 
function  is  bijective  and  that  the  domain  and  range  are  also  appropriate  for  /  . 

To  find  an  expression  for  /  ,  let  y  £  R\  {1}.  Then  there  exists  (exactly  one) 

x  e  R\  {3}  such  that  fix)  =  y.  Further,  f(x)  =  y  if  and  only  if  (x+  \)/{x—  3)  =  y, 
and  this  happens  if  and  only  if  x  =  (3y  +  1 )  /  (y  —  1 ) . 

Therefore  /_1(y)  =x=  (3y  +  1 ) / (y  —  1).  O 

In  our  examples  and  exercises  thus  far,  you  probably  noticed  us  repeating  the 
same  steps:  We  first  check  that  /  :  A  — >  B  is  bijective.  If  it  is,  then  we  know 
exists.  To  find  /-1,  we  choose  y  €  B  and  solve  for  the  unique  x  such  that  f(x)  =  y. 
Then,  by  definition,  /_1(y)  =  x,  and  we  are  done. 

Now  you  should  be  ready  to  do  a  more  challenging  example  as  an  exercise. 

Exercise  16.3.  Let  /  :  Z  — >  N  be  defined  by 

...  .  f  2 n  if  n  >  0 

/(")  =  \-2n-lif„<0' 

We  showed  in  Example  15.7  that  this  function  is  bijective.  Find  a  formula  for  /  . 
(You  might  find  it  helpful  to  reexamine  the  graph  of  /  in  Figure  15.3.)  O 

If  A  is  a  set,  one  very  important  function  mapping  A  to  itself  is  the  identity  func¬ 
tion.  So,  the  identity  function  iA  is  the  function  iA  :  A  — >  A  defined  by  i,\  (x)  =  x 
for  all  x  £  A.  You  should  check  that  iA  is  well-defined,  is  both  one-to-one  and  onto, 
and  is  its  very  own  inverse.  In  addition,  this  function  is  easy  to  use.  For  example, 
if  A  and  B  are  sets  and  /  is  a  function  such  that  /  :  A  — >  B,  then  /  o  iA  =  /,  while 

h°f  =  f- 


Theorem  16.4.  Let  f :  A  — >  B  be  a  bijective  function.  Then 

<0  f  0  f- 1  =  h;  that  is,  (/  O  f- 1 )  (y)  =  y  for  all  y  g  B. 

(ii)  f~l  of  =  iA;  that  is,  (/~*  o  f)(x)  =  xfor  all  x  £  A. 

(iii)  /_1  is  a  bijective  function. 

(iv)  If  g  :  B  — >  A  is  a  function  satisfying  fog  =  iB  or  go  f  =  iA,  then  g  =  /  . 

The  last  part  of  this  theorem  says  that  if  we  know  that  our  function  has  an  inverse, 
then  f~l  is  the  one  and  only  function  satisfying  the  identities  in  (iv).  This  can  come 
in  quite  handy.  Consider  the  following. 

Sometimes,  as  in  Exercise  16.3,  it  is  difficult  to  compute  /-1.  In  these  cases  it  is 
nice  to  check  your  answer.  Theorem  16.4  tells  you  one  way  to  do  so:  Suppose  you 
know  that  /  is  bijective,  and  you  are  claiming  that  g  is  the  inverse.  If  you  find  that 
g  o  f  =  iA  or  fog  =  iB,  you  know  you  have  the  right  answer! 

We  also  remark  here  that  (i)  and  (ii)  above  really  follow  from  the  definition  of 
inverse  function:  f{x)  =y  if  and  only  if  (y)  =  x. 
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Proof,  (i)  If  y  £  B,  let  z  =  f  1  (y) .  By  definition  /  1  (y)  =  z  if  and  only  if  f{z)  =  y. 
Therefore 

(/o/_1)0)  =/(/~1W)  =  f(z)  =  y- 

(ii)  If  x  £  A,  let  z  =  f(x) .  By  definition  f(x)  =z  if  and  only  if  /  1  (z)  =  x.  There¬ 
fore, 

(/_1°/)W  =/_1(/W)  =f~\z)  =x. 

(iii)  We  leave  this  for  you  to  do  in  Problem  16.9. 

(iv)  We  note  first  that  dom(g)  =  dom(/-1)  =  B. 

First  suppose  that  go  f  =  ix-  Then,  using  the  associative  property  of  composition 
and  (i)  above,  we  have 

rx  =  iA°rl  =  =8°(f°rl)=8°iB=g- 

In  exactly  the  same  way  (except  we  use  (ii)  in  place  of  (i)),  we  can  show  that  if 
f  og  =  iB,  then  g  =  f~1.  □ 

Before  applying  Theorem  16.4  make  sure  that  you  check  that  /  really  is  bijective. 
It  is  one  of  the  hypotheses,  after  all! 

Exercise  16.5.  For  each  of  the  functions  and  their  inverses  in  Example  16.1,  Ex¬ 
ample  16.2,  and  Exercise  16.3  check  that  /  1  o/=  i^omi/)  anc*  /° /  1  =  *ran(/)- 

o 

The  theorem  above  includes  the  basic  facts  about  inverses.  But  there  are  more 
theorems  that  will  be  useful  as  we  move  along. 

Theorem  16.6.  Let  f  :  A  -  ■  B  and  g  :  B  —>  C  be  bijective  functions.  Then  go  f  is 
bijective  and  (go/)-1  =  /-1og  . 

Before  we  begin,  let’s  make  sure  we  understand  the  function  and  think  about 
what  we  need  to  prove.  We  note  that  g  o  /  :  A  — >  C.  To  show  that  the  composition 
is  bijective,  we  must  show  that  it  is  one-to-one  and  onto.  To  find  the  inverse,  if  we 
have  a  guess  for  what  it  should  be,  we  can  use  Theorem  16.4  (iv),  which  (in  this 
case)  says  that  if  g  o  /  is  bijective  and  h  is  a  function  such  that  ( h  o  (g  o  f))(x)  =  x 
for  all  x  in  A  or  ((go/)  o h)(y)  =  y  for  all  y  in  B,  then  h  must  be  (go/)-1.  So  all 
we  need  to  do  is  think  of  a  good  candidate  for  Ii  (the  object  we  want  to  show  is  the 
inverse)  and  show  it  works.  But  the  statement  of  the  theorem  gives  us  a  candidate 
for  h.  Now  that  we  have  the  plan,  we  can  try  to  carry  it  out. 

Proof.  First  we’ll  show  that  the  composition  go  /  is  one-to-one.  So  let  x\,X2  £  A. 
If  i.g°f)(xi)  =  (g°f)(x2),  then  g(f(x i))  =  g(f(x 2)).  Now  since  g  is  one-to-one, 
f(x  1)  =  fixi).  But  /  is  also  one-to-one,  and  therefore  x\  =  X2,  as  desired. 

To  see  that  the  composition  is  onto,  let  z  £  C.  Since  g  is  onto,  there  exists  ay  £  B 
such  that  g(y)  =  z.  Since  y  £  B  and  /  is  onto,  there  exists  x  £  A  such  that  f{x)  =  y. 
Therefore,  x  £  A  and 
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(g°f){x)  =  g{f(x))  =  g(y )  =  Z. 

Since  go  f  :  A-^>  C,  we  conclude  that  g  o  f  is  onto. 

Now  we  will  show  that  /_1  og-1  is  the  inverse  of  go  f  by  applying  (iv)  of  The¬ 
orem  16.4  to  go  f.  We  just  showed  that  go  f  is  bijective.  Now  we  check  the  hy¬ 
potheses  of  part  (iv)  of  the  theorem.  First,  note  that  the  domain  is  correct;  that  is, 
f~l°g~l  :C  —>A. 

We  now  show  that  ((/_1  og-1)  o  (gof))(z)  =  Z  for  all  z  £  A.  By  (ii)  of  Theorem 
16.4  applied  twice  (as  well  as  the  associative  property  of  composition),  for  every 
z  £  A  we  have 

((rtog~1)°(gof))(z)=f~1(g^t(g{f(z))))=f~1(f(z))=z. 

Using  (iv)  of  Theorem  16.4  we  may  conclude  that  /-1  =  (go  /)_1.  □ 

Remember  that  Polya  suggests  that  after  solving  a  problem,  we  should  look  back 
and  see  whether  we  can  use  the  result  or  the  method  to  solve  a  different  problem. 
Here’s  a  good  chance  to  try  that  out:  Use  the  proof  above  to  establish  the  following. 

Theorem  16.7.  Let  f :  A  — » B  and  g  :  B  — >  C  be  functions. 

(i)  If  f  and  g  are  one-to-one,  then  go  f  is  one-to-one. 

( ii)  If  f  and  g  are  onto,  then  go  f  is  onto. 

The  converses  of  the  two  statements  in  the  theorem  above  are  not  true.  However, 
two  corresponding  weaker  statements  can  be  made.  In  addition,  part  (iii)  of  Theorem 
16.8  provides  a  useful  characterization  of  the  inverse. 

Theorem  16.8.  Let  f :  A  — >  B  and  g  :  B  — >  C  be  functions. 

(i)  If  go  f  is  onto,  then  g  is  onto. 

(ii)  If  g  o  f  is  one-to-one,  then  f  is  one-to-one. 

( iii)  Suppose  now  that  f  :A—>B  and  g  :  B  — >  A.  If  f  o  g  =  iB  and  go  f  =  iA,  then 

g=r1- 

How  does  (iii)  in  Theorem  16.8  differ  from  part  (iv)  in  Theorem  16.4?  Well,  both 
are  implications,  but  the  antecedent  in  one  is  a  disjunction  and  the  antecedent  in  the 
other  is  a  conjunction.  In  addition,  in  Theorem  16.8,  we  do  not  assume  that  f  or  g 
is  bijective.  You  will  need  to  show  that  the  conditions  in  (iii)  imply  that  /  and  g  are, 
in  fact,  bijective.  If  you  already  know  that  one  of  your  functions  /  and  g  is  bijective. 
Theorem  16.4  will  usually  be  easier  to  use  than  Theorem  16.8. 


Exercise  16.9.  Prove  Theorem  16.8. 


o 
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Definitions 

Definition  16.1.  Given  functions  /  :  A  — >  B  and  g  :  C  — >  D  with  ran(/)  C  C, 
we  define  the  composition  of  /  and  g  as  the  function  g  o  f  :  A  — »  Z),  where 
( go  f)(x)  =  g(f(x)).  The  composition  is  also  called  the  composite  function  from 
A  to  D. 

Definition  16.2.  Let  /  :  A  — >  Z?  be  a  bijective  function.  The  inverse  of  /  is  the  func¬ 
tion  f~ 1  :  B  — >  A  defined  by 


/  1  (y)  =  x  if  and  only  if  f(x)  =  y. 


Definition  16.3.  The  identity  function  on  a  set  A  is  the  function  G  :  A  — >  A,  defined 

by  m  (x)  =  x. 


Solutions  to  Exercises 

Solution  (16.3).  This  problem  only  asks  for  a  formula  for  /-1,  which  we  will  give 
here.  You  need  to  think  about  how  we  obtained  this  formula.  You  should  check  that 
/_1  :  N  — >  Z  defined  by 


really  is  the  inverse  of  /. 


Solution  (16.5).  For  the  function  in  Example  16.1  we  calculate 


(r‘°/)W =rV-5)  =  ((x3 — 5) +5)1/3  =  (x3)1/3 


=  x  = 


and 


(/°/  ')(*)  =/(C*  +  5)1/3)  =  ((x  +  5)1/3)  —  5  =  x  +  5  —  5  =  x  =  j'r(x) . 


For  the  function  in  Example  16.2  the  calculations  are 


(r1of)(x)=r 


and 


Finally,  for  the  function  in  Exercise  16.3  the  calculations  are  as  follows. 
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First,  for  /  1  of,  we  use  the  two  cases  that  /  forces  upon  us.  So  suppose  that 
n  £  Z  and  n  >  0.  Then 


(/  lo/)(«)=/  1(2n)  =  (2n)/2  =  n, 


since  2/7  is  even. 

Now  suppose  that  n  <  0.  Then 

(/“ 1  o  /)(«)  =  r  1(-2n  -  1)  =  —  ((— 2n  -  1)  +  l)/2  =  n, 

since  w  =  — 2n  —  1  is  odd.  Therefore  (f~l  o  f)(n)  =n  =  iz(n),  completing  the  first 
part  of  the  problem. 

Now,  for/o /  ,  we  use  the  two  cases  /-l  forces  upon  us.  So  suppose  that  m  £  N 

is  even.  Then 

(/°/-1)('»)  =  f('n/2)  =  2(m/2)  =  m, 
since  m/2>  0.  If  m  is  odd,  then 

(f°f~l)(m)  =/(-('»+  l)/2)  =  — 2(— (m+  l)/2)  —  1  =w, 

since  —  (/«+  l)/2  <  0.  Therefore  (/ o  =  m  =  ^(w),  as  required. 

Solution  (16.9).  (i)  If  c  £  C,  then  the  fact  that  gofis  onto  implies  that  there  exists 
a  £  A  such  that  (go/) (a)  =  c.  Therefore  g(/(a))  =  c.  Since  f[a)  £  B,  we  have 
shown  that  there  is  an  element  b  =  f[a)  in  B  such  that  g(b)  =  c.  Since  g  :  B  — >  C, 
we  conclude  that  g  is  onto. 

(ii)  If  a\  and  ci2  are  in  A  and  f{a i)  =  fiai),  then  g(f(a i))  =  g(f(a2))-  Therefore 
(go  f)(ai)  =  (go  f)(ci2).  Since  gofis  one-to-one,  a\  =  «2  and  /  is  one-to-one,  as 
desired. 

(iii)  Since  go  f  is  one-to-one,  (ii)  implies  that  /  is  one-to-one.  Since  fog  is 

onto,  (i)  implies  that  /  is  onto.  Thus  /  is  bijective.  Consequently,  (iii)  follows  from 
Theorem  16.4  (iv).  □ 


Problems 


Problem  16.1.  Find  the  compositions  fog  and  go  f  assuming  the  domain  of  each 
is  the  largest  set  of  real  numbers  for  which  the  functions  /,  g,  fog,  and  go  f  make 
sense.  In  your  solution  to  each  of  the  following,  give  the  compositions  and  the  cor¬ 
responding  domain  and  range: 

(a)  f(x)  =  1/(1 +x),  g(x)  =x2; 

(b)  f(x)  =  x 2,  g(x)  =  \Jx  (simplify  this  one); 

(c)  f(x)  =  l/x,  g(x)  =x2+l; 

(d)  f(x)  =  |x|,  g(x)  =f(x). 
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Problem  16.2.  Let  /  :  R  — *  R  be  defined  by  /(x)  =  x3  +4.  Use  Theorem  16.8  to 
show  that  if  g  :  R  — >  R  is  defined  by  g(x)  =  (x  —  4)1/3,  then  g  =  f~l. 

Problem  16.3.  Let  /  :  R2  — >■  R  be  defined  by  f(x.y)  =  x  +  y.  Prove  that  there  is  no 
function  g  :  R  — >  R2  such  that  go  f  =  iR2. 

Problem  16.4.  Let  /  :  R  — >  R2  be  defined  by  /(x)  =  (x,0).  Show  that  there  is  no 
function  g  :  R2  — >  R  such  that  fog  =  iS2. 

Problem  16.5.  Functions  f :  A— >B  are  given  below.  For  each  of  them  find  the  range 
of  /.  Further,  if  possible,  find  /  1  :  B  — >  A.  Rigorous  proofs  are  not  required,  but 
you  should  provide  explanations  for  each  of  your  statements. 

(a)  The  function  / :  R  \  {0}  — >  R  is  defined  by  /(x)  =  1  /x. 

(b)  The  function  / :  R2  — >  R  is  defined  by  f(x,y)  =  x  +  y. 

(c)  The  function  / :  R2  —>  R2  is  defined  by  f(x.y)  =  (y,x). 

(d)  The  function  / :  R  — >  R  is  defined  by  /(x)  =  sinx. 

(e)  The  function  /:{x6l:  — 7T/2  <  x  <  7t/2}  — >  R  is  defined  by  /(x)  =  tanx. 

Problem  16.6.  The  functions  /  :  M  \  {—2}  — >  R  \  {1 }  and  g  :  R  \  { 1 }  — >  R  \  {—2} 

defined  by 


/w 


x  —  3 
x  +  2 


and  g(x) 


are  well-defined  (you  need  not  check  this). 


3  +  2x 
1  —x 


(a)  Calculate  fog  and  go  f. 

(b)  What  can  you  conclude  about  /  and  g  from  your  result  in  part  (a)?  If  you  use 
a  theorem,  give  a  reference. 


Problem  16.7.  (a)  If  possible,  find  examples  of  functions  f\A—>B  and  g :  B  — >  A 

such  that  fog  =  is  when: 

(i)  A  =  {1,2,3},  B  =  {4,5}; 

(ii)  A  =  {1,2},  B  =  {4,5}; 

(iii)  A  =  {1,2,3},  B  =  {4,5,6,7}. 

Draw  diagrams  of  A  and  B  in  each  case  above. 

(b)  Give  an  example  of  sets  A  and  B,  and  functions  /  :  A  — >  B  and  g  :  B  — >  A  such 
that  fog  =  iB,  but  g  o  f  /  iA.  (Thus  the  existence  of  a  function  g  such  that 
f°g  =  h  is  not  enough  to  conclude  that  /  has  an  inverse!)  Why  doesn’t  this 
contradict  Theorem  16.4,  part  (iv)? 

(c)  Give  an  example  of  sets  A  and  B,  and  functions  /  :  A  — *•  B  and  g  :  B  — >  A  such 
that  go  f  =  i,\ ,  but  / “ g  /  /«.  (Thus  the  existence  of  a  function  g  such  that 
go  f  =  iA  is  not  enough  to  conclude  that  /  has  an  inverse!)  Why  doesn’t  this 
contradict  Theorem  16.4,  part  (iv)? 

(d)  Let  A  and  B  be  two  sets,  and  let  /  :  A  — >  B  be  a  function.  Assume  further  that 
there  exists  a  function  g  :  B  — >  A  such  that  /  o g  =  iB.  Must  /  be  one-to-one? 
onto? 


16  Inverses 


177 


(e)  Looking  over  your  work  above,  what  should  be  your  strategy  in  solving  a 
question  like  (d)  above?  Whatever  you  decide,  use  it  to  solve  the  following: 
Let  /  and  g  be  as  above  and  suppose  go  f  =  iA .  Must  / be  one-to-one?  onto? 

Problem  16.8.  Consider  the  function  /  :  R2  — >  R2  defined  by  f(x.y)  =  (x,x  +  y). 
Show  that  /  has  an  inverse  and  find  the  inverse  function. 

Problem#  16.9.  Let  /  :  A  — >  B  be  a  bijective  function.  Prove  part  (Hi)  of  Theo¬ 
rem  16.4  and  show  that  (Z-1)^  =  /. 

The  following  theorem  is  quite  often  useful.  It  provides  an  alternate  way  to  prove 
that  a  function  is  bijective. 

Theorem  16.10.  If  f  :  A  — >  B,  then  the  following  are  equivalent. 

1.  The  function  f  is  a  bijection. 

2.  The  function  f  has  an  inverse. 

3.  There  exists  a  function  h  :  B  — >  A  such  that  h  of  =  i a  and  f  oh  =  (g. 

Problem  16.10.  Prove  Theorem  16. 10.  (One  way  to  do  this  is  to  prove  that  1  implies 
2;  2  implies  3;  and  3  implies  1.) 

Problem  16.11.  Prove  that  / :  R+  — >  R  defined  by  f(x)  =  I  no  is  bijective.  (You  may 
use  properties  of  the  logarithm  function  and  exponential  function.) 

Problem  16.12.  (a)  Give  an  example  of  a  function  /  :  A  — >  A  such  that  f  f  i,\ , 

but  f  of  =  i^.  Must  such  a  function  /  be  one-to-one?  onto? 

(b)  Give  an  example  of  a  nonzero  function  / :  M  — >  R  such  that  (/o/)(x)  =  0  for 
all  x  £  R.  Can  such  a  function  be  one-to-one?  onto? 

Problem  16.13.  Let  /  :  A  — >  A  be  a  function.  Suppose  that  fof:A  — ►  A  is  a  bijec¬ 
tion.  Must  such  a  function  /  be  a  bijection?  (Prove  this  or  give  a  counterexample.) 

Problem  16.14.  Suppose  that  /  :  A  —>  B  and  gi  and  g2  are  functions  from  B  to  A 
such  that  fog  i  =  fog2.  Show  that  if  /  is  bijective,  then  gi  =  g2-  If  gi  of  =  g2°f 
and  /  is  bijective,  must  gi  =  g2? 

Problem  16.15.  Let  /  :  A  — >  A  be  a  function.  Define  a  relation  on  ,4  by  a  ~  b  if  and 
only  if  f(a)  =  /'(/;).  Is  this  an  equivalence  relation?  If  /  is  one-to-one,  what  is  the 
equivalence  class  of  a  point  a  G  A? 

Problem  16.16.  Let  /  :  A  — >  A  be  a  function.  Define  a  relation  oil  ,4  by  a  ~  b  if  and 
only  if  f{a)  =  b.  Is  this  an  equivalence  relation  for  an  arbitrary  function  /?  If  not, 
is  there  a  function  for  which  it  is  an  equivalence  relation? 

Problem#  16.17.  Let,4./LC,  and  D  be  nonempty  sets.  Let  /  :  A— >  B  and  g  :  C  ■  D 
be  functions. 
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(a)  Prove  that  if  /  and  g  are  one-to-one,  then  H :  Ax  C  —  B  x  D  defined  by 


H(a,c)  =  (f{a),g(c)) 


is  a  one-to-one  function.  (Check  that  it  is  one-to-one  and  a  function.) 

(b)  Prove  that  if  /  and  g  are  onto,  then  H  is  also  onto. 

Problem*  16.18.  Let  A,B,C,  and  D  be  nonempty  sets.  Let  /  :  A  — >  B  and  g  :  C  —>  I) 
be  functions.  Consider  H  defined  on  A  U  C  by 


Show  that  there  exist  sets  A,B,C,  and  D  for  which  H  is  not  a  function,  but  there 
also  exist  such  sets  for  which  H  is  a  function.  What  conditions  can  we  place  on  A 
and  C  to  ensure  that  H  is  a  function? 

Problem  16.19.  Let  a£l  with  |a|  <  1.  Define  /  on  the  set  {x  G  R  :  |x|  <  1}  by 


(a)  Show  that  the  range  of  /  is  contained  in  the  set  {xeM:M<l}. 

(b)  Does  /  map  onto  the  set  {x  £  R  :  |xj  <  1}? 

(c)  Prove  that  /  is  one-to-one. 

(d)  Compute  fof. 

(e)  Find/-1. 

Problem  16.20.  Let  R[x]  denote  the  set  of  all  polynomials  with  real  coefficients. 

(See  Problem  10.13.) 

(a)  Define  a  function  /  on  R[x]  by  f(p)  =  p( 0).  What  is  the  range  of  /?  Is  / 
one-to-one? 

(b)  Define  a  function  g  on  the  nonzero  polynomials  in  IR[x]  by  g(p)  =  degree  of 
p.  Is  g  a  function?  Is  it  one-to-one?  What  is  the  range  of  g? 

(c)  Recall  that  a  value  z  is  a  root  of  a  polynomial  p  if  p{z)  —  0.  Define  F  on  R[x] 
by  F(p)  =  a  root  of  p.  Is  F  a  function?  Why  or  why  not? 

(d)  Define  h  on  R[x]  by  (h(p))(x)  =xp{x).  Is  h  a  function?  If  so,  is  it  one-to-one? 
What  is  the  range  of  hi 

(e)  Define  k  :  R[x]  — >  R[x]  by  k(p)  =  pop.  Show  that  k  is  neither  one-to-one  nor 
onto. 

Problem  16.21.  For  each  part  give  examples  of  functions  /  :  A  — >  B  and  g  :  B  ■  C 

satisfying  the  stated  conditions. 

(a)  The  composition  gofis  onto,  but  /  is  not  onto. 

(b)  The  composition  g  o  /  is  one-to-one,  but  g  is  not  one-to-one. 

Problem  16.22.  Let  /  :  R  — >  R  be  a  function  such  that  /  is  onto  and  /  o  /  o  /  =  /. 

Prove  that  /  is  bijective. 
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Problem  16.23.  Let  A,B,C ,  and  D  be  nonempty  sets  with  B  C  C  and  D  C  A.  Sup¬ 
pose  that  both  functions  /  :  A  — >  B  and  g  :C  — >  D  are  onto  and  f  o  go  f  =  f.  (Note 
that  the  compositions  go  f  and  / o g  are  both  defined.) 

(a)  Show  that  (fog)  \  g  is  one-to-one. 

(b)  Give  an  example  to  show  that  (go  f)  \D  is  not  necessarily  one-to-one. 


Chapter  17 

Images  and  Inverse  Images 


In  the  last  chapter,  we  looked  at  where  points  in  the  domain  are  mapped  to  under 
a  function  /  and  where  points  in  the  range  come  from  under  /.  But  sometimes  we 
need  to  look  at  where  /  maps  a  whole  set,  or  where  an  entire  set  comes  from.  So 
here  are  two  definitions  that  are  waiting  to  be  understood. 

Let  /  :  X  — >  Y  be  a  function  and  let  A  C  X.  Then  the  image  of  A  under  /  is  the 
set 

f(A)  =  {/(a)  ■  a  GA}. 

Note  that  /(A)  is  the  notation  we  use  for  this  set,  and  that  this  set  is  a  subset  of  Y. 
In  “street  talk”  the  image  of  A  under  /  is  where  the  elements  of  A  were  taken  by  /. 

Exercise  17.1.  It’s  good  to  start  small.  So  let’s  begin  with  the  two  sets  A  =  {1,2,4} 
and  B  =  {  —  1,1,— 2,3}.  Find  each  of  the  requested  images  under  the  function 
/  :  R  — >  R.  defined  by  f(x)  =  x2. 

(a)  What  is /(A)? 

(b)  What  is /(B)? 

(c)  What  is  /(A  OB)? 

(d)  What  is /(A)  n /(B)? 


We’ll  solve  (a)  for  you  here,  so  you  can  see  what  we  are  asking  you  to  do.  We 
claim  that  /(A)  =  {1,4, 16}.  To  see  this,  we  use  the  definition: 

/(A)  =  {/(a)  :  a  £  A}  =  {/(l),/(2),/(4)}  =  {1,4, 16}.  O 

Small  sets  are  easier  because  you  can  often  list  the  values,  just  as  we  did  above. 
This  won’t  be  possible,  in  general,  as  you  will  see  below. 

We  are  also  interested  in  where  sets  in  the  codomain  come  from.  This  is  called  the 
inverse  image  of  a  set  (because  we  are  going  backwards)  and  there  is  one  unfortunate 
thing  about  it:  the  notation  involves  the  symbol  /  ,  which  we  have  used  only  when 

/  is  bijective.  Well,  here  /  may  not  be  bijective,  and  therefore,  /  1  may  not  be 
a  function.  Though  this  may  be  confusing  at  first,  this  is  generally  agreed  upon 

U.  Daepp  and  P.  Gorkin,  Reading,  Writing,  and  Proving:  A  Closer  Look  at  Mathematics ,  181 

Undergraduate  Texts  in  Mathematics,  DOI  10. 1007/978-1 -44 19-9479-0  1 7, 

©  Springer  Science+Business  Media,  LLC  20 1 1 


182 


17  Images  and  Inverse  Images 


notation  and  you  (the  reader)  must  check  carefully  on  the  context.  Having  said  all 
that,  we  now  define  the  inverse  image. 

Let  /  :  X  — >  Y  be  a  function  and  let  B  C  Y.  Then  the  inverse  image  of  B  under  / 
is  the  set 

r\B)  =  {x€X-.f{x)£B}. 

In  other  words,  the  inverse  image  of  B  is  the  subset  ofX  consisting  of  all  the  elements 
in  the  domain  that  get  mapped  into  B.  Note  that  when  /  is  not  bijective,  the  notation 
/  1  (y)  makes  no  sense  (why?).  If  you  want  to  talk  about  the  inverse  image  of  a  set 
with  just  one  element,  say  so  by  writing  /_1({y}).  (You  may  find  texts  in  which 
the  authors  use  the  notation  f~l  (y),  but  we  find  that  it  often  introduces  unnecessary 
confusion.) 

Exercise  17.2.  Let  /  :  R  — >  R  be  defined  by  f(x)  =  x 2.  Find: 

(a)  /_1({4}); 

(b)  /“'({l, 2, 4}); 

(c)  /  '(/(A)),  where  A  =  {1,2}. 

Again,  we  will  do  (a)  here,  so  you  can  see  what  we  are  asking  you  to  do.  By 
definition, 


/  1  ({4})  =  {x  G  R  :  f{x)  G  {4}}  =  {x  G  R  :  f{x)  =  4}. 

Replacing  /  by  what  it  equals,  we  have 

({4})  =  {v  G  K  :  x2  =  4}  =  {—2,2}.  Q 

Since  the  sets  above  are  small,  we  can  list  all  the  elements.  We  ask  that  you 
now  check  your  understanding  with  more  challenging  sets,  but  still  using  the  same 
function  as  in  the  previous  exercises. 

By  carefully  writing  out  the  definitions  of  the  sets  in  Exercise  17.3,  it  is  possible 
to  guess  what  the  answers  are.  We  provide  rigorous  proofs  for  several  parts  at  the 
end  of  this  chapter.  If  you  wish  to  try  them  yourself  first  (which  you  are  certainly 
encouraged  to  do),  make  sure  that  you  work  from  the  inside  out  on  parts  (e)-(h).  So 
in  part  (e),  for  example,  first  find  /([0, 1])  (which  works  just  like  (a))  and  call  that 
set  A.  Then  find  /_1  (A)  (which  works  just  like  (d)). 

Exercise  17.3.  Let  /  :  R  — >  R  be  defined  by  fix)  =  x2.  Find: 

(a)  /([—!,!]); 

(b)  /(Z); 

(c)  r'(N); 

(d)  r'a-^Oj); 

(e)  /-'(/([(M])); 

(f)  f(rl([-h o])); 

(g)  /-'([0, 1]U[2,4]); 

(h)  /([0, 1]  ("I  [—1,0]); 
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(i)  /([o,i])n/([-i,o]). 


O 


Your  experience  with  concrete  sets  will  help  you  work  with  abstract  sets. 

Exercise  17.4.  Looking  back  at  the  examples  in  the  exercises  above,  decide  which 
of  the  following  you  think  are  true  for  all  functions  /  :  X  — »  Y,  all  subsets  A  and  B 
of  X,  and  all  subsets  C  and  D  of  Y: 


(a)  f{f~\C))=C-, 

(b)  /-1(/(A))=A; 

(c)  f(APiB)  =  f  (A)  D/(B); 

(d)  /(A)  =  f{B)  implies  that  A  =  B\ 

(e)  /- 1  (C)  =  f~ 1  (D)  implies  that  C  =  D. 


o 


All  of  the  statements  above  may  look  reasonable,  yet  they  are  all  false.  Nev¬ 
ertheless,  there  are  many  similar  statements  that  are  true.  You  can  prove  them  all 
with  the  tools  you  have  developed  at  this  point.  To  emphasize  the  accepted  writing 
techniques,  we  provide  an  example  below. 

Theorem  17.5.  Let  f :  X  — >  T  and  let  A\  and  A2  be  subsets  ofX.  Then 


/(AinA2)c/(A,)n/(A2). 


“Understanding  the  problem.”  Remember  that  you  won’t  get  anywhere  if  you 
don’t  know  the  definitions.  So  we  need  to  figure  out  what  the  sets  in  the  statement 
are.  We  begin  by  making  sure  we  know  what  f(A\  DA2),  f(A\),  and  /(A2)  are. 
First,  /(A)  =  {  f(x) :  x  £  A}.  So  that  should  make  it  pretty  clear.  Things  in  f(A\ flAi) 
look  like  /(x)  where  x  €  A 1  DA2.  Now  it  should  occur  to  you  that  you  must  write 
out  what  it  means  to  be  in  f(A\)  IT /(A2).  Once  you  have  done  that,  you  have  done 
the  preliminaries. 

“Devising  a  plan."  When  we  worked  with  sets  with  a  special  form  (like  the  Carte¬ 
sian  product  of  two  sets)  we  emphasized  that  if  we  never  used  the  special  form  of 
the  elements,  we  would  most  likely  never  prove  the  desired  result.  The  same  is  true 
here — if  we  never  use  the  fact  that  the  elements  have  the  form  /(x)  where  x  €  A 1 T  A2 
we  shouldn’t  expect  to  be  able  to  prove  the  result.  The  next  step  is  to  note  that  what 
we  want  to  do  is  to  show  that  one  set  is  contained  in  another  set.  We  know  how  to 
do  that,  too.  So  our  plan  is  to  start  with  an  element  in  the  set  on  the  left  side,  use 
the  special  form  of  this  set,  and  show  the  element  is  in  the  set  on  the  right.  As  we 
“carry  out  our  plan,”  note  how  quickly  we  move  to  the  special  form  of  the  element. 

Proof.  If  y  £  /(A]  TA2),  then  y  =  f{x)  for  some  xSA|  TA2.  Since  x  £  Ai  TA2,  we 
have  x£A]  and  x  £  A2.  Since  xSA]  and  y  =  /(x),  we  see  that  y  £  f(A\).  Similarly, 
since  x  £  A2  and y  =  /(x),  we  see  that  y  £  /(A2).  Therefore y  £  f(A\)  T/(A2).  Thus 


/(AinA2)c/(A!)n/(A2). 


□ 
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Exercise  17.6.  We  already  have  an  example  to  show  that,  with  the  notation  from  the 
theorem  above,  we  need  not  have  f(A\  flA2 )  =  f(A\)  n/(A2).  But  what  is  wrong 
with  the  following  “proof”  of  this  “nonfact”? 

Not  a  proof.  It  follows  from  Theorem  17.5  that  f(A\  nA2)  C  f(A\ )  D/(A2).  To 
show  the  reverse  set  inclusion,  we  let  y  £  f(A\)  ft /(At).  By  definition  of  intersec¬ 
tion,  y  £  f(A\)  and  y  £  /(A2).  Therefore,  y  =  f{x)  for  some  x  in  A\  and  y  =  f(x) 
for  some  x  £  A2.  Since  x  £  A\  and  x  £  A2,  we  see  that  x  £  A 1  fl  At.  Thus  y  =  f(x) 
where  x  £  Aj  (TA2,  so  y  £  f(A\  DAt).  This  proves  that  f{A\)  D/(A2)  C  f{A\  (TA2), 
and  the  nonfact  is  established !  H 

We  know  there’s  something  wrong  above  since  the  assertion  isn’t  always  true. 
But  it  isn’t  always  false  either.  Find  the  error  and  see  if  you  can  think  of  another 
hypothesis  we  might  place  on  /  that  would  help  us  to  determine  the  functions  for 
which  the  assertion  is  true.  O 

The  next  theorem  is  one  you  will  use  repeatedly.  You  really  can  do  all  the  proofs 
yourself. 

Theorem  17.7.  Let  f  :  X  — >  Y.  Let  A,A\,  and  At  be  subsets  ofX  and  B,B\,  and  B2 
subsets  ofY.  Then 

1.  if  A\  CA2,  then  f(Ax)  C  f(A2); 

2.  /(A1UA2)=/(A1)U/(A2); 

3.  f(A\  DA2)  Cf(Ai)  fl/(A2); 

4.  in  general,  f{X  \A)  ^  Y  \/(A); 

5.  ifB\  CB2,  then  rUBA  C  f-l(B2); 

6.  f-l(B1nB2)  =  f-l(Bl)nf-l(B2); 

7.  f-l(BlUB2)=f-1(Bl)Uf-l(B2); 

8.  f-l(Y\B)  =  X\f-1(B); 

9.  A  C  /“'(/(A)); 

10.  f(f~l{B))CB. 

We  have  already  presented  a  proof  of  (3)  in  Theorem  17.5,  and  we  will  provide 
a  proof  of  (9)  in  Example  17.8.  The  other  parts  of  the  theorem  are  left  to  the  reader 
(that’s  you)  in  the  problems.  Remember  that  before  beginning  the  proof  of  each  part 
you  must  make  sure  you  know  what  the  left-hand  side  is,  and  what  the  right-hand 
side  is.  We  suggest  that  you  write  out  the  element  definition  of  both  sides  carefully 
(as  we  do  in  Example  17.8  below),  and  then  show  that  appropriate  relations  hold 
using  acceptable  mathematical  and  writing  techniques. 

If  additional  conditions  are  placed  on  the  function  /,  then  some  of  the  conclu¬ 
sions  in  Theorem  17.7  can  be  strengthened.  We  look  at  such  a  case  in  the  following 
example.  Some  of  the  problems  will  ask  you  to  consider  similar  restrictions. 

Example  17.8.  We  will  prove  part  9  of  Theorem  17.7.  Then  we  will  show  that  the 
inclusion  is,  in  general,  proper.  We  conclude  this  example  by  showing  that  if  /  is 
required  to  be  one-to-one,  then  the  two  sets  are  actually  equal. 
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(a)  First  we  prove  that  if  /  :  X  — >  Y  and  ,4  C  X,  then  A  C  /*'  (/(A)).  Before  we 
begin,  we  note  that  the  right  side  is  a  bit  complicated.  Let’s  make  sure  we 
understand  it:  Since  f~l(B)  =  {x  £  X  :  /(x)  £  Bj  replacing  B  by  /(A),  we 
see  that  /_1  (/(A))  =  {x  £  X  :  f(x)  £  /(A)}.  So  we  must  show  that  if  z  £  A, 
then  z  £  {x  £  X  :  f(x)  £  /(A)};  in  other  words,  we  must  show  that  z  £  X  and 

/(z)  £  /(A)- 

Proof  If  z  £  A,  then  since  A  C  X,  we  know  that  z£X.  By  the  definition  of 
/(A),  we  have  f(z)  £  f(A).  Thus,  z  £  f~l{f{A)),  and  A  C  /“'(/(A)).  □ 

(b)  Figure  17.1  indicates  why,  for  an  arbitrary  function  and  an  arbitrary  set  A,  we 
cannot  expect  that  the  two  sets  A  and  f~l  (/(A))  are  equal.  From  the  diagram 
we  see  that  if  we  let  A  =  {1}  and  define  /  :  {1,2}  — >  {a}  by  /(l)  =  /( 2)  =  a, 
then  A  =  {1}  while  /-'(/(A))  =  /_1(/({1}))  =  /-1(M)  =  {1,2}.  Thus 

(c)  However,  if  the  function  /  :  X  — >  Y  is  one-to-one  and  A  C  X,  then  we  can 

conclude  that  A  =  1  (/'(A)). 

Proof.  The  inclusion  A  C  /  '(./(A))  is  proven  in  (a)  above.  For  the  reverse 
inclusion,  suppose  z  £  /_1(/(A)).  Then  z  £  X  and  f(z)  £  /(A).  Thus,  there 
exists  x  £  A  such  that  /(z)  =  /(x).  Now  /  is  one-to-one  and  so  z  =  x.  But 
x  £  A,  so  z  €  A.  Hence  /  '(/'(A))  C  A,  and  we  conclude  that  the  two  sets  are 
equal.  □ 


Definitions 

Definition  17.1.  Let  /  :  X  — >  Y  be  a  function  and  let  A  C  X.  Then  the  image  of  A 
under  /  is  the  set 

f(A)  =  {/(a)  -a£A}. 


Definition  17.2.  Let  / :  X  — >  T  be  a  function  and  let  B  CY.  Then  the  inverse  image 
of  B  under  /  is  the  set 


f-\B)  =  {x£X-.f{x)£B}. 
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Solutions  to  Exercises 


Solution  (17.1).  You  should  be  able  to  check  that: 

(b)  /(B)  =  {1,4,9}; 

(c)  f{ACiB)  =  {1}; 

(d)  /(A)  n/(B)  =  {1,4}. 

Solution  (17.2).  You  should  be  able  to  check  that: 

(b)  ^({1,2,4})  =  {-1,  l,-V2,>/2,-2,2}; 

(c)  /“‘(/(A))  =/-*({  1, 4})  =  {-1,1, -2, 2}. 

Solution  (17.3).  We  give  complete  solutions  to  (a),  (d),  (f),  and  (h)  followed  by 
answers  to  (b),  (c),  (e),  (g),  and  (i). 

(a)  We  claim  that  /([— 1, 1])  =  [0,1].  To  see  this,  let  y  G  /([—  1, 1]).  By  definition 
of  the  image, 

/([-!, !])  =  {/(*)  [-1,1]}  =  {x2  :x€  [-1,1]}. 

So  there  exists  x£  [—1,1]  such  that  y  =  x2.  Since  x  G  [—1,1],  we  know  that 
0  <  x2  <  1.  Therefore,  ye  [0, 1],  and  /([— 1, 1])  C  [0,1]. 

Conversely,  if  y  e  [0, 1],  then  we  let  x  =  yfy.  Thus  x  €  [0, 1]  C  [—1, 1],  and 
/(x)  =  x2  =  {y/y)2  =  y.  Therefore,  there  exists  x  €  [—1,1]  such  that  y  =  /(x) 
and  y  e  /([— 1, 1]).  So  [0,1]  C  /([—  1, 1]),  and  we  conclude  that  the  two  sets 
are  equal. 

(d)  We  claim  that  /-1([— 1,0])  =  {0}.  To  see  this,  let  x  e  /-1  ([— 1,0]).  By  defi¬ 
nition, 

r1  ([-1,0])  =  {x  e  R  :  fix)  e  [-1,0]}  =  {x  e  K  :  X2  e  [-1,0]}. 

Thus  x  e  /~*([— 1,0])  implies  that  x2  e  [—1,0].  This  is  only  possible  if 
x  =  0.  Therefore  /-1([— 1,0])  C  {0}.  Now  suppose  that  x  G  {0}.  Then 
/(x)  =  /( 0)  =  0.  Therefore,  /(x)  G  [—1,0],  and  x  G  /_1([— 1,0]).  Conse¬ 
quently,  {0}  C  /_1([— 1,0]),  and  we  conclude  that  /^!([— 1,0])  =  {0}. 

(f)  We  claim  that  /(/-1([— 1,0]))  =  {0}.  By  (d),  we  know  /^1([— 1,0])  =  {0}. 
Therefore,  we  need  to  find 

/cr1a-i,o]))=/({o}). 

Thus 

/(r1a-i,o]))=/({o})  =  {/(o)}  =  {o}, 

as  desired. 

(h)  We  work  from  the  inside  out.  Thus, 


/([o,  i]  n  [-1,0])  =  /({ o})  =  {/( o)}  =  {0}. 
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The  answer  to  (b)  is  { z 2  :  z  £  N};  to  (c)  is  {ypn  :  n  £  N}  U  {— ypn  :  n  £  N};  to  (e)  is 
[—1,1];  to  (g)  is  [—1,1]  U  [— 2,  —  y/2]  U  [\/2,2];  and  to  (i)  is  [0, 1], 

There  are  many  possible  ways  to  solve  these  problems.  Though  each  problem 
seems  to  be  different,  what  remains  the  same  in  every  problem  is  that  you  need  to 
understand  the  notation  and  the  definitions. 

Solution  (17.4).  None  of  the  five  statements  holds  for  all  functions  and  sets.  Thus 
we  will  give  a  counterexample  for  each  case.  In  all  cases  we  will  use  the  function 
defined  by  f(x)  =  x2. 

(a)  Exercise  17.3  (f)  provides  a  counterexample. 

(b)  Exercise  17.3  (e)  provides  a  counterexample. 

(c)  Exercise  17.3  (h)  and  (i)  provide  a  counterexample. 

(d)  Using  /  as  defined  above,  /({—  1, 1})  =  {1}  =  /({l})  but  {  —  1,1}  p  {1}. 

(e)  Again  using  the  function  defined  above,  /-1({— 1,0})  =  {0}  =  /-1({0})  but 
{-1,0}  ^{0}. 

Solution  (17.6).  In  “not  a  proof”  we  correctly  establish  that  “y  =  f(x)  for  some 

x  £  A i  and  y  =  f(x)  for  some  x  £  A i.”  From  this  statement  we  incorrectly  conclude 
that  “x  £  A 1  and  x  £  A2.”  We  may  only  conclude  that  there  exists  an  xi  £  A  \  such 
that  y  =  f(x  1 ) ,  and  there  exists  an  X2  £  A2  such  that  y  =  f{x 2) .  We  may  not  conclude 
that  xi  =  X2-  Indeed,  Exercise  17.1  shows  that  there  are  cases  where  neither  of  the 
two  elements  is  in  the  intersection. 

However,  the  following  is  true.  Let  /  :  X  — >  Y  be  an  injective  function,  and  let 
A\  and  A2  be  subsets  of  X.  Then  f{A\  [TA2)  =  f{A\)  IT/(A2).  Why?  Well,  we  know 
that  /(x  1)  =  y  =  f{x 2)  and  we  now  know  that  /  is  injective.  Thus,  xi  =  X2,  so 

xi  £  A 1  DA2.  The  rest  of  the  “not  a  proof”  is  valid. 


Spotlight:  Minimum  or  Infimum? 

If  you’ve  ever  forgotten  to  show  that  the  infimum  of  a  set  was  the  minimum,  this 
historical  example  should  show  you  that  it  can  happen  to  the  best  of  us. 

Suppose  you  know  the  temperature  on  the  surface  of  the  earth  (because  people 
on  the  surface  measured  it)  and  you  want  the  temperature  on  the  inside  (but  no  one 
can  get  to  the  place  to  measure  it).  How  do  you  get  the  temperature?  This  kind  of 
question  interested  several  famous  mathematicians.  In  fact,  there’s  plenty  of  mathe¬ 
matical  research  done  today  that  is  related  to  this  problem. 

This  problem  is  known  as  the  Dirichlet  problem.  It  can  be  studied  in  more  gen¬ 
erality,  but  we’ll  stick  to  looking  at  functions  defined  on  a  sphere.  To  understand 
the  statement,  you  need  to  have  studied  several  variable  calculus.  At  the  end  of  this 
spotlight,  we  will  state  the  problem  on  the  sphere,  along  with  the  references,  for 
those  of  you  who  have  the  background. 

The  solution  to  this  problem  used  something  called  the  “Dirichlet  principle.”  The 
idea  of  the  principle  was  to  look  at  a  collection  of  certain  integrals  with  the  region 
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of  integration  fixed,  but  with  different  integrands.  It  was  first  argued  that  the  val¬ 
ues  of  the  integrals  were  bounded  below  (and  therefore,  as  we  have  learned,  there 
was  an  infimum).  From  there,  the  mathematicians  assumed  that  for  one  of  the  func¬ 
tions,  the  integral  was  the  minimum.  This  principle  was  used  by  many  excellent 
mathematicians:  by  George  Green,  Georg  Friedrich  Bernhard  Riemann,  Sir  William 
Thomson  (also  known  as  Lord  Kelvin),  and  others.  In  1870,  Karl  Theodor  Wil¬ 
helm  Weierstrass  presented  an  important  paper  about  the  validity  of  this  argument. 
Even  the  title  of  his  article  “Uber  das  sogenannte  Dirichlet’sche  Princip”  (On  the 
so-called  Dirichlet  principle)  is  enough  to  show  what  Weierstrass  thought  of  the 
principle  [107].  He  began  his  paper  by  reconstructing  Dirichlet’s  argument.  He  then 
explained  that  though  an  expression  may  have  a  lower  bound  that  we  can  get  ar¬ 
bitrarily  close  to,  we  may  never  actually  reach  it.  Weierstrass  concluded  his  paper 
with  an  example  showing  how  this  might  happen.  Almost  thirty  years  later,  David 
Hilbert  supplied  a  proof  of  the  principle  for  certain  cases  when  he  presented  what 
he  called  the  “resuscitation”  of  the  Dirichlet  principle  [86,  p.  67]. 

It  may  seem  odd  that  mathematicians  of  this  calibre  would  use  an  unproven  prin¬ 
ciple.  There  are  two  things  to  remember.  First,  the  principle  was  supported  on  phys¬ 
ical  grounds.  Second,  rigor  was  still  being  introduced  to  mathematics.  In  spite  of 
its  unusual  history,  the  Dirichlet  principle  served  an  important  purpose.  In  Kline’s 
words  [59,  p.  704]  “Had  the  progress  made  with  the  use  of  the  principle  awaited 
Hilbert’s  work,  a  large  segment  of  nineteenth-century  work  on  potential  theory  and 
function  theory  would  have  been  lost.” 

More  information  on  this  is  available  in  [35],  [59],  [32],  and  [71].  Another  crit¬ 
icism  of  the  Dirichlet  principle,  from  a  different  point  of  view,  was  published  by 
Friedrich  Prym.  More  information  about  this  can  be  found  in  [87].  The  statement  of 
the  problem  in  R3  is  the  following:  Let  /  be  a  continuous  real-valued  function  on  the 
sphere  of  radius  one  (the  unit  sphere).  A  real-valued  function  g  is  called  harmonic 
on  the  open  unit  ball  ({(x,y,z)  €  R3  :  x2  +  y2  +z2  <  1})  if  g  has  continuous  second 
partial  derivatives  satisfying 


gxx  +  gyy  +  gzz  =  0 

throughout  the  ball.  The  question  is:  Does  there  exist  a  function  F  that  is  continuous 
on  the  closed  ball  of  radius  one,  equal  to  /  on  the  unit  sphere,  and  harmonic  on  the 
open  unit  ball? 


Problems 

Problem  17.1.  Recall  that  [a,7>]  denotes  the  closed  interval  from  a  to  b,  while  (a.b) 
denotes  the  open  interval.  For  the  function  / :  R  — >  R  defined  by  /(x)  =  3x  —  1,  find: 


(a)  /((0, 1)); 

(b)  where  a,b  £  R  and  a  <  b; 

(c)  /-'((— 2 ,  — 1)); 
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(d)  /  1((a,b)),  where  a,b  £  R  and  a  <  b. 

Problem  17.2.  For  the  function  /  :  M  — ►  JR  defined  by  f(x)  =  2x2,  find: 

(a)  /(( 0,1)); 

(b)  /((— 1,3)); 

(c)  and  (in  general)  f((a,b)),  where  a,b  £  ffi.  and  a  <b\ 

(d)  /_1((-2,-l)); 

(e)  /_1((0,2)); 

(f)  and  (in  general)  /  l((a,b)),  where  a,b  £  JR.  and  a  <  b. 

Actually,  we  are  really  only  interested  in  your  answers  to  (c)  and  (f).  So  why  did 
you  have  to  work  all  the  other  parts? 

Problem  17.3.  For  the  function  /  :  M  — >  K  defined  by  f(x)  =  \x\,  find: 

(a)  /((—l,  1)); 

(b)  /({— 1,1}); 

(c)  /-'({I}); 

(d)  /-'([— 1,0)); 

(e)  /_1(/([0, 1])). 

Problem  17.4.  Let  /  :  K  — >  K  be  defined  by  f(x)  =  9  —  x2.  Find  the  two  sets 
/((-3,l])and/-1((-l,4)). 

Problem  17.5.  Consider  the  function  %(o,i)  :  K  — >■  K.  (this  is  the  characteristic  of 
Definition  14.4).  Find: 

(a)  £(o,i) ((0, 1)); 

(b)  £(o, !)_((—!, 3)); 

(c)  and  (in  general)  £(o.n((a,^)),  where  a,b  £  K  and  a  <b\  prove  that  the  set 
you  found  is  correct; 

(d)  ^((-2,-1)); 

(e)  £(-^((0,2)); 

(f)  and  (in  general)  X(p\){{aib))*  where  a,b  £  K  and  a  <  b;  prove  that  the  set 
you  found  is  correct, 

Actually,  we  are  really  only  interested  in  your  answers  to  (c)  and  (f).  So  why  did 
you  have  to  work  all  the  other  parts? 

Problem  17.6.  We  denote  the  characteristic  function  of  Z  in  R  by  Xz  ■  R  — >  M  (see 
Definition  14.4).  In  each  case  below,  start  by  writing  out  the  definition  for  the  par¬ 
ticular  set  and  function.  Then  write  the  solution  to  each  of  the  following  in  as  simple 
a  form  as  possible: 

(a)  £z(Z+); 

(b)  Zz'(^+)’ 

(c)  Zz(Zz1(Z+)); 

(d)  xzr1(*z(z+)). 
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Problem  17.7.  Let  / :  R  — >  K  be  defined  by  f(x)  =  x4  +  1. 

(a)  Make  a  careful  graph  of  /. 

(b)  Using  your  graph,  show  how  you  can  guess  /([0,2]). 

(c)  Prove  that  your  guess  for  /([0, 2])  is  correct. 

(d)  Use  your  graph  to  find  /“*  (j 2, 17]). 

(e)  Prove  that  your  guess  for  /_1([2, 17])  is  correct. 

Problem  17.8.  Let  p  and  q  be  two  polynomials  of  degree  two  with  real  coefficients. 
(See  Problem  10.13  for  definitions.)  Suppose  p_1({0})  =  q^1  ({0}). 

(a)  Give  an  example  of  such  p  and  q,  with  p  /  q. 

(b)  Suppose  that  p_1({0})  =  {0, 1}  =  ^  1  ({0}).  Must  p  =  ql  Either  prove  this 
or  give  a  counterexample. 

Problem  17.9.  Let  / :  Z  — >  N  be  defined  by 


Find  /( 2Z)  and  prove  that  your  answer  is  correct. 

Problem  17.10.  Prove  Theorem  17.7  part  1. 

Problem  17.11.  Using  Theorem  17.7  part  1,  rather  than  element-chasing,  prove 

Theorem  17.7  part  3. 

Problem  17.12.  Prove  Theorem  17.7  part  2. 

Problem  17.13.  (a)  Establish  part  4  of  Theorem  17.7.  We  suggest  the  following 

strategy:  Try  to  prove  that  the  two  sets  are  equal.  If  you  do  this  carefully,  you 
may  start  to  wish  for  restrictions  on  /  that  you  don’t  have.  This  should  help 
you  think  of  examples  to  show  that  the  two  sets  need  not  be  equal. 

(b)  If  /  is  onto,  does  the  statement  in  Theorem  17.7  part  4  become  an  equality? 
What  if  /  is  one-to-one? 

(c)  Show  that  if  /  is  bijective,  then  equality  holds. 

Problem  17.14.  (a)  Prove  Theorem  17.7  part  5. 

(b)  In  the  same  context,  what  can  you  conclude  if  B\  lh  -  State  your  result  and 

prove  it. 

Problem  17.15.  Prove  Theorem  17.7  part  6. 

Problem  17.16.  Prove  Theorem  17.7  part  7. 

Problem  17.17.  Prove  Theorem  17.7  part  8. 

Problem  17.18.  (a)  Prove  Theorem  17.7  part  10. 

(b)  Give  an  example  to  show  that  the  two  sets  may  not  be  equal. 

(c)  If  /  is  onto,  must  the  two  sets  be  equal? 
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(d)  If  /  is  one-to-one,  must  the  two  sets  be  equal? 

Problem  17.19.  Let  X  and  Y  be  nonempty  sets  and  /  :X-tfa  function. 

(a)  Prove  or  give  a  counterexample  to  the  statement:  If  A  and  B  are  subsets  of  X, 
then  f(A\B)=f(A)\f(B). 

(b)  Find  necessary  and  sufficient  conditions  on  the  function  /  such  that  for  all 
subsets  A  and  B  of  X,  we  have  f(A\B )  =  f(A)\f(B). 

Problem  17.20.  Let  /  :  X  — >  Y  be  a  function  satisfying  /(A)  n  f(B )  =  0  whenever 
A  and  B  are  sets  with  A  D  B  =  0. 

(a)  Give  an  example  of  such  a  function.  Prove  that  your  example  satisfies  the 
condition  above. 

(b)  Prove  that  such  a  function  must  be  one-to-one. 

Problem  17.21.  Let  f :  A— >B  be  a  function.  Prove  that  if  /  is  onto,  then  the  collec¬ 
tion  {  /  1  ({£>})  :  b  £  B}  partitions  the  set  A. 

Problem  17.22.  Suppose  that  /  :  X  — >  Y  is  a  function,  and  let  Ai  and  A 2  be  subsets 
ofX. 

(a)  If/(A1)=/(A2),mustA1=A2? 

(b)  Let  /  be  a  bijective  function.  Show  that  if  f(A\)  =  /(A2),  then  A 1  =  A2. 
Indicate  clearly  where  you  use  one-to-one  or  onto.  Did  you  use  both? 

Problem  17.23.  Suppose  that  /  :  X  — >  Y  is  a  function,  and  let  B\  and  /1 2  be  subsets 
of  Y. 

(a)  If  =  /_1(Z?2),  must  B\  =  fi2? 

(b)  Let  /  be  a  bijective  function.  Show  that  if  /-1  (Z?i)  =  /_1  (Z?2),  then  B\  =  Z?2. 
Indicate  clearly  where  you  use  one-to-one  or  onto.  Did  you  use  both? 

Problem  17.24.  Let  X  be  a  nonempty  set  and  let  A]  and  A2  be  subsets  of  X.  Recall 
the  characteristic  function  of  A  in  X  of  Definition  14.4. 

(a)  If  XAi  =  Xa2,  must  Ai  =  A2? 

(b)  We  define  the  product  Xax  •  Xa2  pointwise  by  (XAi  ■  Xa2)(x)  =  Xax  (x)  •  Xa2  (*) 
for  all  x  eX.  Prove  that  Xa1  '  Xa2  =  XAtnA2- 

(c)  Show  that  Xa,  +  Xa2  ~  XAtnA2  =  Xa (In  other  words,  for  each  xGX,we 
have  Xax  (x)  +  Xa2  (x)  ~  XAtnA2  {x)  =  Xax ua2  (x) .) 

(d)  Can  you  find  a  similar  result  for  %x  ,i,  ? 

Problem  17.25.  (For  students  with  a  background  in  calculus.)  For  real  numbers  c.  d 
with  c  <  d  we  denote  the  open  interval  in  K  by  (c.d)  =  {x  eR:c<x<rf|.  Recall 
that  a  function  /  :  R  — >  K  is  strictly  increasing  if  for  all  x,y  e  dom(/)  whenever 
x  <  y,  then  f[x)  <  f(y). 

(a)  Show  that  there  are  continuous  functions  /  :  R  — >  K.  and  real  numbers  a .  b 
with  a  <  b  such  that  f((a,b))  ^  (f(a),f(b)). 

(b)  Prove  that  a  continuous  function  /  :  R  — >  K  is  strictly  increasing  if  and  only 
if  f((a,b))  =  ( f(a),f(b ))  for  all  a,b  e  R  with  a  <b. 


Chapter  18 

Mathematical  Induction 


Suppose  that  you  want  to  show  something  is  true  for  all  positive  integers.  You  could 
start  by  checking  that  the  statement  is  true  for  n  =  1,  n  =  2,  and  so  on,  but  you  would 
have  to  stop  somewhere.  Even  if  you  check  lots  and  lots  of  integers,  you  can  run  into 
problems.  Consider  the  following: 

Let  us  suppose  that  you  are  asked  to  prove  that  n2+n  +  41  is  prime  for  every  pos¬ 
itive  integer  n.  You  might  think  the  following  is  good  enough  to  convince  someone: 
if  /(h)  =  n2  +  n  +  41,  then  /(l)  =  43  (which  is  prime),  /( 2)  =  47  (which  is  prime), 
/( 3)  =  53  (prime  too),  and  so  on.  In  fact,  checking  the  first  39  integers  reveals  that 
f{ii)  is  indeed  prime  for  n=  1, ... ,39.  Is  this  enough  evidence  to  prove  that  it  is 
true  for  all  positive  integers  nl  Check  n  =  40:  /(40)  =  1681,  which  is  divisible  by 
41.  What’s  the  moral  of  this  story?  That  examples,  even  many,  many  examples,  are 
not  a  method  of  proof.  It  can  help  us  find  counterexamples  or  it  can  motivate  us 
to  formulate  a  conjecture,  but  unless  we  can  check  every  single  case,  it  will  never 
prove  anything. 

One  mathematical  technique  to  prove  that  a  statement  holds  for  all  positive  inte¬ 
gers  is  to  show  that  the  statement  is  true  for  n  =  1  and  that  whenever  it  is  true  for  a 
positive  integer  n,  it  is  true  for  the  next  positive  integer  n+  1.  Then,  since  you  have 
shown  it  is  true  for  n  =  1,  it  must  be  true  for  n  =  2  (because  it’s  always  true  for 
a  successor).  Now  that  the  statement  is  true  for  n  =  2,  it  has  to  be  true  for  n  =  3, 
because  3  is  the  integer  after  2,  and  so  on.  This  is  called  mathematical  induction, 
and  a  more  precise  description  of  this  method  of  proof  is  given  below. 

This  method  is  sometimes  compared  to  lining  up  dominoes  and  making  them  fall 
down  (see  H.  Steinhaus  [102]).  What  has  to  happen?  The  first  one  has  to  fall,  and 
every  time  one  falls  the  one  after  it  must  fall.  Once  this  happens,  all  the  dominoes 
do  fall  down. 

Theorem  18.1  (Principle  of  mathematical  induction).  For  an  integer  n,  let  Pin) 
denote  an  assertion.  Suppose  that 

(i)  (The  base  step)  P(l)  is  true,  and 

(ii)  ( The  induction  step)  for  all  positive  integers  n,  if  P{n)  is  true,  then  P{n  +  1) 
is  true. 
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Then  P{n)  holds  for  all  positive  integers  n. 

The  principle  of  mathematical  induction  is  a  direct  consequence  of  the  well¬ 
ordering  principle  of  N  we  came  across  in  Chapter  12.  The  proof  of  Theorem  18.1 
will  be  by  contradiction:  were  the  induction  principle  false,  then  we  could  con¬ 
struct  a  nonempty  subset  of  the  natural  numbers  that  would  not  have  a  minimum — a 
contradiction  to  the  well-ordering  principle.  This  is  the  main  idea  in  the  proof  that 
follows. 

Proof.  Suppose  the  induction  principle  were  false.  Then  there  would  exist  an  as¬ 
sertion  P  that  would  satisfy  conditions  (i)  and  (ii)  of  the  theorem,  but  Pin)  would 
be  false  for  some  n  €  Z+.  So  let  A  =  {k  €  Z+  :  P(k)  is  false}.  Our  supposition  im¬ 
plies  that  A  is  nonempty.  By  the  well-ordering  principle  [p.  125],  the  set  A  has  a 
minimum.  Let  m  denote  this  minimum.  By  condition  (i),  m  f  1.  Since  m  €  Z+,  it 
follows  that  m  >  2.  Consider  the  integer  n  =  m  —  1  >  1.  Since  n  <  m  and  m  is  the 
minimum  of  A,  we  know  that  n^A.  Thus  P(n)  is  true.  By  condition  (ii),  P(n  +  1 )  is 
true  too.  But  P(n  +  1)  =  P(m),  so  P(m)  must  also  be  true,  a  contradiction.  □ 

Students  often  mistakenly  believe  condition  (ii)  says  that  P(n)  is  true,  and  ask 
why  we  would  state  it  again  as  a  conclusion.  Look  carefully  at  condition  (ii).  Note 
that  it  is  an  implication.  We  are  not  saying  that  P{n)  is  true.  We  are  saying  that  if 
P(n)  is  true,  then  Pin  +  1)  is  tme.  The  antecedent  in  this  implication  is  called  the 
induction  hypothesis. 

The  next  example  is  one  that  is  associated  with  Carl  Friedrich  Gauss.  As  one 
version  of  the  story  goes,  when  Gauss  was  10  years  old,  his  teacher,  Herr  Biittner, 
asked  the  students  to  sum  the  integers  from  1  to  100.  Gauss  did  it  almost  instantly. 
It  is  believed  that  he  did  it  by  the  following  method. 

Write  the  sum  horizontally  forwards  and  backwards  as: 

1  +  2  +  3  +  •  •  •  +  99  +  100 


100  +  99  +  98  +  •••  +  2  +  1. 

Now  add  vertically.  When  you  do  this,  you  will  get  101  one  hundred  times;  in 
other  words,  you  get  ( 101 ) ( 100) .  This  is  twice  the  sum  that  you  needed,  so  the 
answer  must  be  ( 101 )  (100)  /2.  There  is  nothing  special  about  the  integer  100.  If 

you  try  this  with  a  general  positive  integer  n,  you  will  see  that  1  +2  +  3-1 - Yn 

=  n(n  + 1)/2  for  every  positive  integer  n.  What  a  nice  formula!  You  will  give  it 
a  rigorous  proof  using  mathematical  induction  when  you  work  Problem  18.1.  Is 
something  like  this  formula  true  for  the  sums  of  squares  of  the  first  n  integers? 
Indeed  it  is. 


Example  18.2.  Using  mathematical  induction,  show  that 


l2  +  22  H - 


n(n  +  l)(2n  +  1) 

6 


for  every  positive  integer  n. 
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Proof.  Let  P(n)  be  the  assertion  that 


12  +  22H - b  n2 


n(n  +  l)(2n+  1) 

6 


First  we  check  the  base  step:  P(l)  is  the  statement  that  1  =  (1(1  +  1)(2+  l))/6, 
and  this  is  certainly  true. 

Now  we  verify  the  induction  step.  Let  n  £  Z+  and  suppose  Pin)  holds.  Thus  we 
suppose  that  for  an  n  £  Z+  we  have 


l2  +  22  - 


9  n(n  +  l)(2n+l) 
-  n  =  — - — 

6 


(18.1) 


We  wish  to  show  that  P(n  +  1)  holds;  that  is,  that 

2  ~2  t  in'* 1  (w+l)((n+l)  +  l)(2(u  + 1)  + 1) 

l2  +  22  H - b(n+lr  =  2 - — - ’  - - - 

6 

We  start  by  grouping  the  left  side  of  P(n  +  1)  and  then  simplify  as  follows: 

l2  +  22  H - b«2  +  (n+  l)2 

=  (l2  +  22  H —  •  +  n2)  +  (n  +  l)2 


n(n  +  l)(2n+  1) 


+  (n+  l)2  (by  our  induction  hypothesis  (18.1)) 


—  (n+  1) 
=  (n+  1) 


n(2n  +  1) 


+  («+!)  )  (factor  out  n  +  1 ) 


2n2  +  7n  +  6^ 
6  ) 


=  (n+i)(,,+2)(2"+3) 

6 

(n  +  l)((n  +  1)  +  l)(2(n  +  1)  +  1) 


By  mathematical  induction  we  conclude  that  the  assertion  holds  for  all  positive  in¬ 
tegers.  □ 


Induction  proofs  must  contain  certain  steps.  Look  at  the  proof  above  and  see  if 
you  can  find  each  of  the  steps  described  below. 

(1)  You  should  indicate  clearly  what  you  are  trying  to  prove.  (2)  There  is  always 
the  base  step,  in  which  we  check  the  first  assertion.  (This  need  not  always  begin 
with  n  =  1;  it  can  begin  with  n  =  3,  n  =  0,  or  even  at  a  negative  integer!  In  fact,  as 
long  as  what  you  say  is  true,  it  can  begin  at  any  integer  you  want  it  to  begin  at.)  (3) 
Then  we  have  the  induction  step,  in  which  we  show  that  for  each  n  £  Z  that  is  at 
least  as  big  as  the  integer  used  in  the  base  step,  if  P(n)  is  true,  then  P{n  +  1)  is  true. 

Of  course,  you  still  need  to  write  using  complete  sentences,  and  you  still  need  to 
introduce  every  variable  to  the  reader  when  the  reader  meets  it  (not  after  the  reader 
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has  met  it  for  the  first  time!).  Finally,  do  tell  the  reader  what  the  base  step  is  (“First 
we  show  the  assertion  holds  for  n  =  1”),  what  the  induction  step  is  (“We  suppose 
that  P(n)  holds  for  an  n  £  Z+\  that  is  . . .  holds”),  and  what  you  will  prove  (“We  will 
show  that  P{n  +1)  holds;  that  is  . . .  holds”).  This  is  as  much  for  your  benefit  as  it  is 
for  the  reader’s.  This  step  shows  you  where  you  will  begin  and  where  you  will  have 
to  end.  Then  show  what  you  said  you  will  show  and  indicate  clearly  where  you  use 
the  induction  hypothesis.  End  your  proof  with  a  concluding  sentence. 

Many  statements  proved  by  induction  involve  sums  or  products.  We  remind  you 
of  the  standard  notation  for  this.  In  the  following,  k  £  Z  and  £  R.  The  notation 
for  sum  is 

n 

a\  +  fl2  T  T  ' ' '  T  an  =  ^  ak  j 

k=  1 


and  the  notation  for  product  is 


n 

a\  ■  a2 '  <23 . an  =  I  1 1 /  /  . 

k—  1 


This  notation  often  saves  space  and  makes  a  statement  look  neater.  For  instance, 
the  result  we  proved  in  Example  18.2  is 


n 

For  n  £  Z+,  we  have  ^  k2 

k—1 


n(n  +  1)(2  n  +  1) 
6 


If  you  are  ever  unsure  about  what  such  a  statement  says,  you  will  almost  certainly 
find  it  helpful  to  rewrite  the  expression  the  long  way. 

Exercise  18.3.  Let  x\  ,X2,  ■  ■  ■  ,x„  be  real  numbers.  Prove  that  for  n  £  Z+,  both  of  the 
following  hold: 

(a)  irGu**i=nLi  Wand 

(b)  |E2=i**l<E2=iW-  O 


The  following  exercise  illustrates  how  induction  can  go  awry.  It’s  cute,  but  not 
very  mathematical.  A  similar  example,  but  a  more  mathematical  one,  appears  in  the 
problems.  See  if  you  can  spot  the  error  in  that  one. 

Exercise  18.4.  All  people  at  Bucknell  University  have  the  same  color  hair. 

Not  a  proof.  Let  P(n)  be  the  assertion  that  every  group  of  n  people  has  the  same 
color  hair  (as  each  other).  Then  P(l)  is  the  statement  that  one  person  has  the  same 
color  hair  as  herself.  This  is  certainly  true.  So  let  n  £  Z+  and  suppose  that  P(n)  is 
true;  that  is,  when  we  have  n  people,  they  all  have  the  same  color  hair.  We  need 
to  show  that  this  implies  that  n  +  1  people  in  a  group  have  the  same  color  hair.  So 
consider  a  group  of  n  +  1  people.  If  we  look  at  the  first  n  of  them  (people  1  through 
n  in  the  group),  by  the  induction  hypothesis  they  all  have  the  same  color  hair,  which 
we  may  as  well  assume  is  black  for  right  now.  So  the  first  n  people  all  have  black 
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hair.  Now  consider  the  last  n  people  in  this  group  (people  2  through  n  +  1  in  the 
group).  Again,  by  our  induction  hypothesis,  they  all  have  the  same  color  hair.  Those 
who  are  in  both  groups  are  also  in  the  first  group,  and  therefore  have  black  hair.  (See 
Figure  18.1.)  Thus,  since  all  people  in  the  second  group  have  the  same  color  hair, 
everyone  has  black  hair.  By  mathematical  induction  we  conclude  that  all  people  at 
Bucknell  have  the  same  color  hair.  What  a  boring  campus.  H 

There  must  be  an  error!  Exactly  where  is  it?  O 

Exercise  18.5.  Use  induction  to  prove  that  for  all  natural  numbers  n,  the  expression 
4”  —  1  is  a  multiple  of  3. 

“Understanding  the  problem.”  Well,  once  again,  it’s  probably  a  good  idea  to 
make  sure  that  we  know  what  everything  means  here.  We  need  to  show  that  4"  —  1 
is  a  multiple  of  3  for  every  natural  number  n.  That  means  we  need  to  show  that  there 
exists  an  integer  k  such  that  4"  —  1  =  3k. 

“Devising  a  plan."  The  outline  is  presented  to  you  below  and  the  complete  solu¬ 
tion  appears  at  the  end  of  the  chapter. 

1.  Say  clearly  what  the  assertion  P(n)  is.  (Most  mathematicians  write  this  out  with¬ 
out  labeling  the  assertion  with  P(n)  explicitly.) 

2.  Check  the  base  step  (n  =  0). 

3.  Write  out  the  induction  step  in  the  principle  of  mathematical  induction  clearly. 
Make  sure  you  replace  P(n)  by  what  it  says,  and  replace  P(n+  1)  by  what  it  says. 
This  will  help  you  figure  out  what  you  are  supposing  (you  are  supposing  P(n)) 
and  what  you  need  to  end  with  (you  need  to  end  with  P(n  +  1)). 

4.  Write  out  the  induction  hypothesis;  that  is,  write  out  what  you  are  assuming  to 
be  true. 

5.  Having  done  all  of  the  above,  look  at  4n+  —  1  and  show  that  it  is  divisible  by  3. 
Indicate  clearly  where  you  use  the  induction  hypothesis. 

6.  State  your  conclusion  clearly.  O 


198 


1 8  Mathematical  Induction 


Induction  can  also  be  used  to  define  functions.  Perhaps  the  best  known  example 
of  this  technique  is  the  definition  of  factorial:  For  n  G  N,  we  define  n  factorial, 
written  as  n\,  as  follows: 


0!  =  1 

(n  +  1)!  =  (n+1) -n\  for«>0. 

What  have  we  done?  We  defined  a  function  g  :  N  — >  N,  denoted  g(n)  =  n\,  by  telling 
you  that  g(0)  =  1,  g(l)  =  1  -g(O)  =  1,  g(2)  =  2-g(l)  =  2,  g(3)  =  3  ■  g(2)  =  6,  etc. 

You  might  be  thinking,  “They  say  they’ve  defined  a  function.  Is  it  really  well- 
defined?”  If  this  is  what  you  are,  in  fact,  thinking,  that’s  great.  To  prove  that  g  is  a 
function,  we’ll  actually  need  a  theorem.  This  theorem  is  often  called  the  recursion 
theorem. 

Theorem  18.6  (Recursion  theorem).  Let  X  be  a  nonempty  set,  f  :  X  — >X  a  func¬ 
tion,  and  a  G  X.  Then  there  is  a  unique  function  g  :  N  — +  X  such  that  g(0)  =  a  and 
g(n  +  1)  =  f(g(n))  for  all  n  G  N. 

We  usually  begin  by  understanding  the  problem.  In  this  case,  understanding  the 
theorem  might  also  require  some  effort.  What  does  it  say?  To  define  g,  we  say  how  to 
get  started  (that’s  what  we  are  doing  when  we  tell  you  g( 0)  =  a).  But  that’s  only  how 
you  get  started.  To  proceed  from  your  starting  point,  we  tell  you  how  to  compute 
g(n  +1).  You  have  a  rule  (that’s  /)  and  the  previous  values  of  g  (that’s  where  g{n) 
comes  into  play)  and  you  compose  them  (that’s  g(n  +  1)  =  f(g(n))).  And  what  we 
are  saying  is  that  this  g  that  you  get  is  a  well-defined  function. 

“Understanding  the  problem"  There  are  two  parts  to  this  proof.  We  will  need  to 
show  that  a  function  g  exists  and  that  at  most  one  function  g  exists.  If  we  can  figure 
out  how  to  show  the  function  exists,  we  will  try  to  show  that  uniqueness  follows 
the  way  it  often  does.  So,  how  can  you  show  that  something  you  don’t  have  exists? 
Any  function  we  define  has  to  be  a  relation  satisfying  the  conditions  stated.  But,  on 
top  of  the  conditions,  we  also  want  to  obtain  uniqueness.  We  will  fall  back  on  the 
definition  of  function  as  a  relation  and  we’ll  try  to  find  the  smallest  relation  that  does 
what  we  need.  Being  the  smallest  should  make  it  unique,  if  we  are  lucky.  Whatever 
object  we  end  up  constructing  will  also  need  to  satisfy  conditions  (i)  and  (ii)  of  the 
definition  of  function.  It  might,  at  this  point,  be  advisable  to  review  our  original 
function  definition.  Definition  14.1. 

“Devising  a  plan.”  For  the  existence,  we’ve  decided  that  the  smallest  relation  has 
a  chance  at  satisfying  the  conditions.  So  we  want  a  “small  set”  that  “does  certain 
things.”  Getting  a  small  set  suggests  intersecting  things  that  do  what  we  want.  So 
we  will  look  at  all  relations  from  N  to  Y;  that  is,  all  subsets  A  of  N  x  X,  with  the 
property  that  (0,a)  G  A  and  whenever  (n,x)  G  A,  then  (n  +  l,/(x))  G  A.  If  we  can 
show  that  there  is  at  least  one  such  relation,  then  we’ll  take  the  intersection  of  all  of 
them  and  show  that  this  is  our  function  g. 

To  show  uniqueness  we  will  suppose  that  there  is  a  second  function  h  satisfying 
our  needs  and  we  will  try  to  contradict  something,  most  likely  the  minimal  nature 
of  g.  Let  us  see  whether  we  can  turn  this  plan  into  a  successful  proof. 
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As  you  read  this  proof,  you’ll  notice  that  the  set  ^  that  we  define  comes  up  a  lot. 
So  you  should  write  down,  in  some  very  handy  place,  what  you  have  to  do  to  get 
into  the  set  . 

Proof.  We  will  first  show  that  there  exists  a  function  g  :  N  — »  X  satisfying  the  stated 
conditions.  We  let  &  be  the  set  of  all  subsets  A  of  N  x  X  with  the  condition  that 
(0 ,a)  £  A  and  whenever  (; n,x )  €  A,  then  (n+  l,/(x))  £  A.  Since  Nxlsf’wesee 
that  'if  ^  0.  Consequently,  we  can  form  the  intersection  of  all  elements  of  9f,  which 
we  call  g.  So  g  =  (~)Ae^A.  Obviously  gCNxX.  Since  (0,a)  £  A  for  all  A  £  ^  we 
also  have  (0,a)  £  g.  If  (n,x)  £  g,  then  (n.x)  £  A  for  all  A  £  ^ .  This  implies  that 
(n  +  1  ,/(x))  £  A  for  all  A  £  Hence  (n  +  1  ,/(x))  £  g.  This  shows  that  g  £  'tf  and 
g  is  a  relation  from  N  to  X.  We  claim  that,  in  fact,  g  :  N  — >  X  is  a  function.  Since  the 
domain  and  codomain  are  specified,  we  need  only  show  that  the  two  conditions  of 
the  function  definition  hold. 

For  condition  (i)  of  the  function  definition  we  need  to  show  that  for  each  n  £  N 
there  exists  x  £  X  such  that  (n,x)  £  g.  We  will  prove  this  by  induction  on  n.  First  the 
base  step:  Because  g  £  ^,  we  know  that  (0 ,a)  £  g  where  a  £  X. 

For  the  induction  step,  let  n  £  N  and  suppose  that  (n.x)  £  g.  Since  x  £  X,  we 
know  that  /(x)  £  X.  Thus,  since  g  £  “if  we  conclude  that  (n  +  l,/(x))  £  g.  So  for 
n  +  1,  the  element  /(x)  is  the  element  of  X  that  we  were  looking  for;  that  is,  by 
induction,  we  know  that  for  each  n  £  N  there  exists  x  £  X  such  that  (n.x)  £  g  and 
we  conclude  that  condition  (i)  of  the  function  definition  holds. 

For  condition  (ii),  we  need  to  show  that  for  each  n  £  N  if  (n.x)  and  (n.y)  are 
elements  of  g,  then  x  =  y.  We  will  show  this,  again,  by  induction  on  n.  For  the  base 
step,  we  have  (0,a)  £  g.  Suppose  that  (0,y)  £  g  withy  ^  a.  Define  h\  =  g\  {(0,y)}. 
Then  h\  C  g.  Clearly,  (0 ,a)  £  h\  since  we  haven’t  removed  it  from  g.  If  for  some 
n  £  N  we  have  (n,x)  £  h\,  then  the  fact  that  h\  C  g  implies  that  (n,x)  £  g.  Therefore, 
(n+  l,/(x))  £  g.  Since  (n+  1  ,/(x))  (0,y)  we  conclude  that  (n+  1  ,/(x))  £  h\. 

Thus  h\  £  ^  and  h\  C  g.  This  contradicts  the  construction  of  g  and  ensures  that 
y  =  a.  Thus  we  have  handled  the  base  case  of  the  induction. 

We  now  show  the  induction  step.  So,  let  n  £  N  and  suppose  that  whenever  (n.x) 
and  (n,  y)  are  elements  of  g,  then  x  =  y.  We  must  show  that  whenever  (n  +  1 .  u)  and 
(n  +  1 ,  v)  are  elelments  of  g,  then  u  =  v. 

By  condition  (i)  above  there  exists  z  £  X  with  (n.z)  £  g  and,  since  g  £  iC,  we 
know  that  ( n  +  1  ,f(z))  £  g-  Suppose  that  there  exists  w  £  X  with  f(z)  w  and 
(n  +  1 ,  w)  £  g.  As  before,  we  define  h2  =  g  \  { (n  +  1 ,  w) }  and  we  claim  that  I12  £  ^ . 
To  this  end,  note  that  (0,  a)  (n  +  1 ,  w)  and  (0 ,a)  £  g.  Hence  (0 ,a)  £  I12,  so  the  first 
condition  for  admittance  to  ^  is  satisfied.  Furthermore,  if  m  £  N  and  (m.x)  £  I12, 
we  claim  (m  +  l,/(x))  £  li2- 

To  this  end,  note  that  ( m,x )  £  I12  implies  that  (m.x)  £  g.  Because  g  £  'if  we  have 
(m  +  1  ,/(x))  £  g.  If  m  n,  then  (m+  l,/(x))  ^  («  +  1,  w)  and  (m+  l,/(x))  £  fe.  If 
m  =  n,  then  we  have  (m.x)  =  (n,x)  £  g  and  we  chose  z  so  that  (n,z)  £  g-  Thus,  by  our 
induction  hypothesis,  x  =  z.  Since  /(x)  =  f(z)  w,  we  conclude  that  (. m+  l,/(x)) 

(n  + 1 ,  w) .  This  shows  that  ( m  +  1 ,  /(x) )  £  and  completes  the  proof  that  I12  £  ^ . 
So,  we  have  constructed  an  element,  I12,  of  'X'  that  is  strictly  contained  in  g.  This 
contradicts  the  minimality  of  g  and  shows  that  w  =  f(z). 
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By  induction,  condition  (ii)  of  the  function  definition  also  holds.  Thus  the  inter¬ 
section  g  is  a  well-defined  function  g  :  N  — >  X. 

To  show  the  uniqueness  of  g  :  N  — satisfying  g(0)  =  a  and  g(n+  1)  =  f(g(n)) 
for  all  n  £  N,  we  suppose  to  the  contrary  that  there  is  a  different  function  k  :  N  — >  X 
that  also  satisfies  k( 0)  =  a  and  kin  +  1)  =  f{k{n))  for  all  n  £  N.  We  define  the 
set  S  =  {x  £  N  :  g(x)  ^  fc(x)}.  Now  we  suppose  that  k  and  g  are  different  func¬ 
tions,  so  S  ^  0.  By  the  well-ordering  principle  of  N  this  set  has  a  minimum,  which 
we  call  m  =  minS.  So  g(m)  ^  k(m).  Since  g(0)  =  a  =  k( 0)  we  know  that  m  >  0. 
Therefore  m  —  1  £  N  and  in  -  1  (f_  S.  Hence  g(m  —  1)  =  k(m  —  1).  This  implies  that 
g(m)  =  f(g{m  —  1))  =  f(k(m  —  1))  =  k(m).  This  is  a  contradiction  and  shows  that 
our  supposition  is  wrong.  We  conclude  that  g  =  k  and  our  function  is  unique.  □ 

We  now  have  a  way  of  defining  a  function  on  the  natural  numbers:  1)  We  give 
the  value  of  the  function  at  the  initial  point  (this  may  be  at  n  =  0,«  =  1,  or  it  may 
be  any  other  natural  number)  and  2)  we  give  a  rule  about  how  to  find  the  value  of 
a  natural  number  in  terms  of  the  function  value  of  the  previous  number.  A  function 
defined  in  this  manner  is  said  to  be  defined  recursively  or  defined  by  induction. 

Exercise  18.7.  We  define  g  :  N  — >  N  recursively  by 

g(  0)  =  1  andg(n+ 1)  =  5g(n). 


(a)  Find  g(l)  and  g(8). 

(b)  For  this  example  describe  the  value  a,  the  set  X,  and  the  function  /  in  the 

recursion  theorem.  Why  is  the  recursion  theorem  needed  here?  O 

We  return  to  our  motivating  example,  the  factorial  function.  It’s  a  function  you’ve 
known  for  years,  most  likely,  and  yet  it  is  more  complicated  than  it  appears — more 
complicated  than  most  of  the  other  examples  in  this  section.  For  example,  the  re¬ 
cursion  theorem  uses  g(n  +  1)  =  f{g(n)).  But  we  need  g(n  +  1)  =  (n  +  l)g(n)  and 
it’s  difficult  to  imagine  how  to  express  this  as  g(f(n)).  It  feels  more  like  a  rule 
combining  two  functions,  (n+  1)  and  gin).  We  exploit  this  observation  below. 

Example  18.8.  We  defined  the  factorial  function  above.  Explain  how  the  recursion 
theorem  can  be  used  to  show  that  this  definition  describes  a  unique  function. 

For  this  example,  let  a  =  (0, 1),  X  =  N  x  N,  and  let  / :  N  x  N  — >  N  x  N  be  defined 
by  f(x,y )  =  (x+  1,  (jr+  l)y).  The  recursion  theorem  tells  us  that  there  is  a  unique 
function  g:N-»NxN  with  g(0)  =  (0, 1)  and  g(n  +  1)  =  f(g(n))}  Well,  we  aren’t 
interested  in  the  first  coordinate;  only  the  second  one  (why?).  So  we  compose  g 
with  the  well-defined  function  i:NxN->N  where  n  (x,y)  =  y.  If  each  function  is 
well-defined  and  unique,  then  their  composition  n  o  g  :  N  — >  N  is  too.  Before  going 

1  At  this  point,  you  probably  know  that  you  should  write  out  g(0),g(l),g(2),  etc.,  until  you  under¬ 
stand  what’s  happening. 
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on,  check  that  the  first  coordinate  of  g(n)  is  n,  and  note  that  the  second  coordinate 
of  g(n)  is  n (g(n)).  Now  we  get 


(sog)(0)  =  7r(0, 1)  =  1  and 

(7Tog)(n+l)  =n(f(g(n)))  =  n{n  +  \,{n+\)n(g(n)))  =  (n+  l)(jlog)(n). 


If  we  write  (nog)(n)  =n\,  then  we  have  the  familiar  definition. 


o 


Definitions 

Definition  18.1.  For  n  £  N,  we  define  n  factorial,  written  as  n\,  as  follows: 


0!  =  1 

(n  +  1) !  =  (n  +  1)  •  n\  for  n>  0. 


Definition  18.2  (for  Problem  18.22).  A  subset  S  of  R2  is  convex  if  for  every  two 
points  x,y  £  .S',  the  line  segment  joining  x  and  y  again  lies  in  S. 

Definition  18.3  (for  Problem  18.24).  A  triangular  number,  Tn,  is  the  number  of 
equally  spaced  points  that  can  be  used  to  form  an  equilateral  triangle  with  sides  built 
of  n  equally  spaced  points  (see  Figure  18.2). 

Definition  18.4  (for  Problem  18.25).  For  k,n  £  N  with  k  <  n  we  define  the  bino¬ 
mial  coefficient  as 


Solutions  to  Exercises 


Solution  (18.3). 


Proof.  [Proof  of  (a)]  The  base  step  n  =  1  is  trivial. 


Tt  =  1  T2  =  3  T3  =  6 


t4  =  10 


Fig.  18.2  Triangular  numbers 
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For  the  induction  step,  let  n  G  Z+  and  suppose  that  [+_]  xk  I  =  ri!t=i  \xk\-  Then 


72+1 

/  n  \ 

n** 

= 

(  n+  )  xn+ 1 

k=  1 

\*=1  / 

n* 

k=  1 


|xn+i|  (by  Theorem  5.3) 


'  n  \ 

"T  +  |jr„+i|  (by  induction  hypothesis) 

\k=\  J 


n+ 1 

=  nw. 

k=\ 


The  result  follows  from  the  principle  of  mathematical  induction. 


Proof.  [Proof  of  (b)]  We  will  use  the  triangle  inequality  (Theorem  5.8),  which  has 
been  proven  (by  you)  in  Problem  5.14.  Our  proof  will  be  by  induction  on  n.  For 
n  G  Z+,  let  P(n)  denote  the  assertion  that  |  Y!k=ixk  I  <  ELt  W\- 
The  validity  of  the  base  step,  n  =  1,  is  clear. 

Now  let  n  be  a  positive  integer  and  suppose  that  P(n)  holds;  that  is,  we  let  n  G  Z+ 
and  suppose  that  Ylk=  \  xk  <  Y!k=i  IxaI-  We  must  show  that  P(n+  1)  holds;  in  other 
words,  we  must  show  that  xj.  <  \xk\-  But 


n+ 1 

Tjxk 

k=  I 


—  |(xl  H - \~xn)  +xn+ 1 1 

<  |vi  H - hx„  |  +  \xn+\  |  (by  the  triangle  inequality) 


k=  I 


|xn+l  | 


<  V  \xk  \  +  \x„+i  |  (by  the  induction  hypothesis) 

k=  1 
n+1 

E  w> 


A=1 


and  the  result  now  follows  from  the  principle  of  mathematical  induction.  □ 

Solution  (18.4).  If  the  base  step  is  for  n  =  1,  then  the  induction  step,  Pin)  implies 
P(n  +  1),  needs  to  be  valid  for  all  n>  1.  We  made  the  following  argument:  “Those 
who  are  in  both  groups  are  also  in  the  first  group  and  therefore  they  have  black 
hair.”  This  argument  is  not  valid  if  n  =  1 .  In  that  case,  the  group  of  the  first  n  people 
is  disjoint  from  the  group  of  the  last  n  people.  However,  our  argument  requires  that 
some  person  be  in  both  groups.  Hence  the  reasoning  falls  apart  right  where  it  should: 
If  a  second  person  joins  a  black-haired  person,  there  is  no  guarantee  that  he  or  she 
will  also  have  black  hair. 
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Solution  (18.5). 

Proof.  For  n  £  N,  let  P(n)  denote  the  assertion  that  4”  —  1  is  a  multiple  of  3;  that  is, 
there  is  k  £  Z  such  that  4"  —  1  =  3k.  We  will  prove  this  by  induction  on  n. 

We  check  the  base  step.  For  n  =  0  the  statement  becomes  4°  —  1  =  0  is  divisible 
by  3.  This  is  obviously  true. 

Now  we  check  the  induction  step.  Let  n  £N  and  suppose  that  there  exists  k  £  Z 
such  that  4"  —  1  =  3k.  We  need  to  show  that  there  exists  Z  £  Z  such  that  4n+1  —  1  =  3/. 
Consider  the  following  calculation: 

4"+1  -  1  =  4-4"  -  1  =  3  -  4"  +  (4"  -  1)  =  3  -  4"  +  3  k=  3(4  n+k), 

where  the  second-to-last  equality  is  justified  by  the  induction  hypothesis.  Now  set 
l  =  4"  +  k.  Then  l  £  Z  and  4” 1 1  —  1  =31.  Hence  the  induction  step  is  established. 
By  the  principle  of  mathematical  induction,  4"  —  1  is  divisible  by  3  for  all  n  £  N. 

□ 


Solution  (18.7). 

(a)  g(l)  =  5g(0)  =5-1=5. 

g(  8)  =  5g(7)  =  52g(6)  =  53g(5)  =  54g(4)  =  5sg(3)  =  56g(2)  =  57g(l)  =  58. 

(b)  In  the  theorem,  we  set  a  =  1,  X  =  N,  and  / :  N  — >  N  defined  by  f{x)  =  5x.  We 
can  now  verify  that  g(0)  =  1  and  g(n  +  1)  =  f(g(n))  =  5 g(n).  The  theorem 
tells  us  that  the  function  g  is  well-defined  and  unique.  Without  this  theorem, 
we  wouldn’t  be  able  to  conclude  that  this  is  true! 


Problems 

Problem  18.1.  Prove  that  1+2-1 - Yn  =  n(n  +  l)/2  for  every  positive  integer  n, 

using  the  principle  of  mathematical  induction. 

Problem  18.2.  Prove  that  1  +  3  +  5  H - 1-  (2 n  —  1 )  -  ir  for  every  positive  integer 

n,  using  the  principle  of  mathematical  induction. 

Problem  18.3.  Prove  that  13  +  23H - h  /;3  =  (1+2-1 - b  n)1  for  every  positive 

integer  n,  using  the  principle  of  mathematical  induction. 

Problem  18.4.  Prove  that  if  n  £  Z+  and  r  is  a  real  number  such  that  r  /  1 ,  then 


Problem  18.5.  Show  that  2"  <  n !  for  all  integers  with  n  >  5. 

Problem  18.6.  Use  induction  to  prove  Bernoulli’s  inequality:  Forx  £  R,  if  1  +x  >  0, 
then  (1  +x)n  >  1  +nx  for  all  n  £  N. 


E 
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Problem  18.7.  Show  that  for  every  positive  integer  n. 


111  1  ,  n 

—  +  —  +  —  +  •••  +  —  R  1  +  —  • 
1  2  3  2n  -  2 


(This  can  be  used  to  show  that  the  harmonic  series  I  +  j  +  jH - - diverges.) 

Problem  18.8.  Show  that  2"  >  rr  for  all  integers  n  with  n  >  5. 

Problem  18.9.  Prove  that  8  divides  52"  —  1  for  all  n  £  N. 


Problem  18.10.  Suppose  that  g  :  N  — >  N  satisfies  g(n  +  1)  =  g(n)  +  g(l)  for  all 
n  £  N. 

(a)  Find  g(0). 

(b)  Show  that  g(n  +  m)  =  g(n)  +g(m)  for  all  n,m  £  N. 

Problem  18.11.  Let  g  :  N  — >  R  1  and  let  a  be  a  positive  real  number.  Suppose  that  g 
has  the  properties  that  g(l)  =  a  and  g{m  +  n)  =  g(m)g(n)  for  all  natural  numbers  n 
and  m. 

(a)  Find  g(0).  Justify  your  answer. 

(b)  Define  g  recursively. 

(c)  Prove  that  g(n)  =  a"  for  all  n  £  N. 

Problem  18.12.  Let  ai,a2 ,...,an  be  real  numbers  that  satisfy  \ai\  <  1  for  all 
j—  1,2, ...  ,n.  Prove  that  for  all  n  £  Z+  the  following  holds: 


n  aij~i 


<  E  \ai~l\- 

j= i 


Problem  18.13.  Show  that  for  all  positive  integers  n, 

2 {yjn  +  1  —  1 )  <  1  H - —  +  ■  •  •  H - -p.  <  2 \fn. 

V2  y/n 


Problem  18.14.  Show  that  for  all  integers  n  >  2, 


n 


n 


1  \  2_ 
y/k)  <  »2' 


Problem  18.15.  Find  the  error  in  the  Not  a  proof  below.  (See  Problem  10. 13  for  the 
definition  of  the  degree  of  a  polynomial.) 

Nontheorem.  Let  p  be  a  polynomial  of  positive  degree  n  such  that  p  is  a  product  of 
degree-one  polynomials  and  p{ 0)  =  0.  If  c  £  R  satisfies  p{c)  =  0,  then  c  =  0. 

In  other  words,  our  claim  is  that  if  p(x )  =  ax{a\x  +  b\ )  •  •  •  (a„-ix  +  bn~\),  where 
a, a i, . . .  ,anp,b\ , . . .  ,b„~ i  £  R  and  a, a i , . . .  ,a„_i  ^  0,  then  the  only  root  of  p  is  0. 
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Not  a  proof.  We  will  prove  this  statement  by  induction  on  the  degree  n  of  the  poly¬ 
nomial  p. 

For  the  base  step,  we  let  n  =  1.  Since  /;(())  =  0,  we  can  write  p(x)  =  ax  for  some 
a  £  JR.  and  0.  If  p[c)  =  0,  then  p(c)  =  ac  =  0.  Since  a  0,  we  conclude  that 
c  =  0. 

For  the  induction  step,  let  n  £  Z  1  and  suppose  that  if  p  is  a  polynomial  of  degree 
n  that  is  a  product  of  degree-one  polynomials  and  satisfies  p(0)  =  0,  then  pic)  =  0 
implies  that  c  =  0.  Let  p  be  a  polynomial  of  degree  n  +  1  that  factors  into  n  + 
1  degree-one  polynomials  and  satisfies  p( 0)  =  0.  We  need  to  show  that  pic)  =  0 
implies  that  c  =  0.  Write  p(x)  =  ax(a\x  +  bi)  ■  ■  ■  (anx  +  bn),  where  a,a\,...,an  are 
nonzero  real  numbers  and  bi,...,bn  G  EL  Suppose  that  p(c)  =  0.  Then 

0  =  p(c)  =  ac(a\c  +  bi)  ■  ■  ■  (a„c  +  b„). 

One  of  the  factors,  ac,a\c-\-b\, . . .  ,anc  +  bn,  must  vanish.  Rearranging  terms  if  nec¬ 
essary,  we  may  assume  that  the  factor  ac  or  the  factor  a\c  +  b\  vanishes.  Now, 

q{x)  =  ax{a\x  +  b\)  ■  ■  ■  (a„_ix  +  b„_i) 

is  a  polynomial  of  degree  n  that  is  a  product  of  degree-one  polynomials  and  satisfies 
q( 0)  =  0.  Since  ac{a\c  +  b\ )  =  0,  we  have  q(c)  =  0.  Since  our  induction  hypothesis 
applies  to  q,  we  conclude  that  c  =  0.  Therefore,  p{c)  =  0  implies  that  c  =  0,  and  the 
nontheorem  follows  from  mathematical  induction.  0 

Problem  18.16.  We  define  the  function  g  :  N  — >  K  1  recursively  by  g(0)  =  1  and 

g(«+l)  =  g(g|,2;f5  for  n  G  N. 

(a)  Calculate  g(3). 

(b)  Find  the  value  of  a,  the  set  X,  and  the  function  /  used  in  the  recursion  theorem 
to  justify  the  recursive  definition  of  this  particular  function  g. 

Problem  18.17.  Let  X  =  {(x,y)  G  R2  :  x  >  0,y  >  0,  and  x  +  y  <  1}.  We  define  the 
function  g  :  N  — >•  £?(X)  recursively  as  follows: 
g(0)  =  X  and  for  n  G  N, 

g(n+  1)  = 

{(x/2,y/2)  :  (x,y)  G  g(n)}  U  {((x+  l)/2,y/2)  :  (x,y)  £  g(n)j 
U{(-r/2.(y+1)/2)  :  (x,y)€g(n)j. 

Sketch  g(0),g(l),g(2),  and  tg(3).  What  kind  of  object  is  this  function  building? 

Problem  18.18.  Let  /  :NxN— >NxNbe  defined  by  f(x,y )  =  (x-l-  l,y2/x)  and 
choose  (1,5)  G  N  x  N.  According  to  the  recursion  theorem,  there  is  a  unique  func¬ 
tion  ^:N-»NxN  such  that  g(0)  =  (1,5)  and  gin  +  1)  =  f(g(n ))  for  n  £  N.  Let 
7T  :  N  x  N  — >  N  be  defined  by  7r(x, y)  =  y  and  let  /z  =  n  o  g  :  N  — >  N. 

(a)  Calculate  h  (2). 
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(b)  Define  h  recursively  by  giving  li( 0)  and  h(n  +  1)  in  terms  of  h(ti)  and  n. 

Problem  18.19.  Write  recursive  functions  for  the  following,  identifying  the  point 
a ,  the  set  X,  and  the  functions  /  and  g  in  the  recursion  theorem.  Prove  that  your 
answers  are  correct. 

(a)  Given  a  real  number  r  >  0,  write  a  recursive  function  that  computes  rn  to 
every  n  £  N; 

(b)  Write  a  recursive  function  that  computes  the  sum  of  the  first  n  positive  inte¬ 
gers. 

There  is  an  equivalent  form  of  the  principle  of  mathematical  induction,  namely: 

Theorem  18.9  (Second  principle  of  mathematical  induction).  For  an  integer  n, 
let  Q{n)  denote  an  assertion.  Suppose  that 

(i)  2(1)  is  true  and 

(ii)  for  all  positive  integers  n,  if  Q{  1),  ■  ■  ■  ,Q(n)  are  true,  then  Q{n  +1)  is  true. 
Then  Q{n)  holds  for  all  positive  integers  n. 

Problem  18.20.  Prove  that  the  first  principle  of  mathematical  induction  (Theo¬ 
rem  18. 1)  implies  the  second  one  (Theorem  18.9).  To  do  so,  let  P(n)  be  the  assertion 
“Q(  1),...,<2(«)  are  true.” 

Problem  18.21.  Prove  that  every  integer  n,  where  n  >  2,  is  a  prime  or  the  product 
of  prime  numbers.  (We  have  used  this  before;  this  shows  that  every  integer  n  >2 
can  be  factored  as  a  product  of  primes.  If  you  also  prove  the  uniqueness  of  this 
factorization,  you  will  have  proved  the  fundamental  theorem  of  arithmetic.) 

Problem  18.22.  A  subset  S  of  R2  is  convex  if  for  every  two  points  x.y  €  S,  the  line 
segment  joining  x  and  y  again  lies  in  S.  Recall  that  an  interior  angle  at  a  vertex  of  a 
convex  polygon  is  the  smaller  of  the  two  angles  formed  by  the  edges  at  that  vertex. 

Prove  that  for  an  integer  n,  where  n  >  3,  the  sum  of  all  the  interior  angles  of  a 
convex  polygon  with  n  vertices  is  (n  —  2)180  degrees.  (See  Figure  18.3.) 


Fig.  18.3  The  sum  of  all  the  interior  angles  in  this  convex  polygon  is  4  ■  180' 
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Problem  18.23.  Let  p„  be  a  polynomial  with  real  coefficients  and  of  positive  degree 
n.  (See  Problem  10.13  for  the  definitions.) 

(a)  Suppose  p„(x)  =  a„xn  +an~\xn~l  H - t-aix+ao-  For  a  real  number  a,  what 

is  the  largest  the  degree  of  qn,  defined  by  qn{x)  =  p„{x)  —  (x  —  a)anxn^  ‘ ,  can 
be? 

(b)  Let  a  £  ffi.  and  n  £  Z+.  Prove  that  pn{a )  =  0  if  and  only  if  (x  —  a)  is  a  factor 
of  pn{x). 

Problem  18.24.  A  triangular  number,  Tn,  is  the  number  of  equally  spaced  points 
that  can  be  used  to  form  an  equilateral  triangle  with  sides  built  of  n  equally  spaced 
points  (see  Figure  18.2  on  page  201). 

(a)  Find  a  formula  for  the  n,h  triangular  number,  and  prove  that  your  formula  is 
correct. 

(b)  Can  you  think  of  a  (familiar)  game  that  uses  7)?  7)  ? 

For  k,  n  £  N  with  k  <  n  we  define  the  binomial  coefficient  as 

/  n\  nl 

k\{n  —  k)\ 

Because  the  binomial  coefficient  is  the  number  of  ways  that  we  can  choose  k  differ¬ 
ent  elements  from  a  set  of  n  elements,  we  read  as  “n  choose  k.” 

Theorem  18.10  (Binomial  theorem).  Let  a,b  £  R\  {0}  and  n  £  Z+.  Then 

{a  +  br=±fyjV-k. 

Problem  18.25.  This  problem  refers  to  the  notation  and  theorem  above. 

(a)  Compute  each  of  the  following: 


(b)  Consider  the  special  case  of  Theorem  18. lOin  which  (m+  l)2  =  m2  +  2m  +  1, 
where  m  £  N.  A  “picture  proof”  is  presented  in  Figure  18.4.  Explain  the  pic¬ 
ture  proof. 

(c)  Prove  that  for  all  k,n  £  N  with  1  <  k  <  n,  we  get 


(If  you  write  out  what  it  means,  life  will  be  a  lot  easier.) 

(d)  Use  part  (c)  to  prove  Theorem  18. 10.  (See  Project  29.8,  on  Pascal’s  triangle.) 
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m  dots 


m+1 

dots 


Fig.  18.4  "Proof”  of  (m  +  !)■  =  m2  +  2m  +  1 


(e)  Prove  that 

^  f  (—  1)^  =  0  for  all  n  €  Z+. 
k= o  W 

Problem  18.26.  Deduce  the  well-ordering  principle  of  N,  stated  in  Chapter  12,  from 
Theorem  18.9  stated  on  page  206. 

Recall  that  we  used  the  well-ordering  principle  of  N  to  prove  Theorem  18.1  and 
that  Theorem  18.1  implies  Theorem  18.9  (as  shown  in  Problem  18.20).  So  this 
problem  shows  that  the  principle  of  mathematical  induction,  the  second  principle 
of  mathematical  induction,  and  the  well-ordering  principle  of  N  are  all  equivalent. 


Chapter  19 

Sequences 


We  have  all  seen  lists  of  numbers.  For  example,  we’ve  all  worked  with  a  list  of 
positive  even  integers  presented  in  increasing  order,  (2,4,6, 8, ...  ,2 n,...),  where 

n  =  1,2, 3, 4, _ The  positive  odd  numbers  (1,3, 5, 7, . ..,2n  —  1, . . .)  can  also  be 

presented  in  such  a  list,  where  n=  1, 2, 3, 4, ... .  What  we  are  interested  in  here  is  a 
precise  definition  of  “infinite  list.” 

Here’s  an  example  from  your  childhood  of  a  problem  that  yields  such  a  list.  Let 
n  be  a  positive  integer  with  n  >  2.  Suppose  that  we  have  n  children  arranged  in  a 
circle,  and  that  rather  than  use  their  names,  we  number  them  {1,2, ...  ,n}.  Say  these 
children  want  to  see  who  goes  first  in  a  game.  They  begin  by  eliminating  the  second 
child,  and  then  proceed  around  the  circle,  eliminating  every  other  child  until  there 
is  only  one  child  left.  That  happy  child  goes  first.  The  question  is,  where  should  you 
stand  in  order  to  be  the  winning  child?  Let’s  start  small:  if  there  are  two  children, 
you  should  stand  in  the  first  spot.  If  there  are  three,  you  should  stand  in  the  third 
spot.  If  there  are  four,  you  should  stand  in  the  first  spot.  Where  should  you  stand  if 
there  are  n  children?  (This  challenging  problem  is  known  as  the  Josephus  problem. 
The  answer  appears  at  the  end  of  the  chapter.  Of  course,  the  children  can  count  off 
by  three  or  four,  giving  us  a  new  problem  to  solve.)  In  this  problem,  for  each  group 
of  n  children,  we  have  an  answer.  Thus  we  again  have  a  list  of  numbers.  We  now 
turn  to  the  definition  of  “list.” 

A  sequence  is  a  function  /  from  the  natural  numbers  N  to  a  set  X .  In  this  chapter, 
we  will  concentrate  on  the  case  X  =  R.  It  is  standard  to  write  x„  =  f(n),  and  to  refer 
to  xn  as  a  term  in  the  sequence.  The  sequence  will  be  denoted  (x„)“_0,  or  just 
(x„)  when  it  is  clear  which  n  we  are  referring  to  or  when  it  doesn’t  matter  where 
the  sequence  starts.  We  can  begin  a  sequence  at  an  integer  other  than  n  =  0  when 
convenient,  and  we  will  often  begin  the  sequence  at  n  =  1  without  much  fanfare. 
We’ll  tell  you  where  we  are  starting  when  it  really  matters. 

Since  sequences  are  functions,  we  can  graph  them  as  functions  defined  on  the 
nonnegative  (or  positive)  integers  and  then  we  can  see  what  they  are  doing. 

Example  19.1.  The  sequence  (x„)  is  defined  by  x„  =  1  +  1  /n  for  n  £  Z+.  We  will 
write  out  the  first  few  terms  and  graph  the  beginning  of  the  sequence  in  Figure  19.1. 
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Fig.  19.1  Graph  of  ( xn ),  where  xn  —  1  +  l/n  for  n  >  1 


The  first  four  terms  are 


xt  =2,  x2  =  x3  =  -,  x4  =  -. 

Exercise  19.2.  For  each  of  the  sequences  given  below,  write  out  the  first  four  terms 
and  graph  the  beginning  of  the  sequence. 

(a)  Let  xn  =  1  —  (—  1 )",  for  n  =  0,1,2, _ 

(b)  Let  xn  =  n/(n+  1),  for  n  =  0, 1,2, _ 

(c)  Letx„  =  («2  +  1)/(1  —  n),  for  n  =  2,3, _  O 


When  we  look  at  sequences,  we  notice  that  different  sequences  behave  in  dif¬ 
ferent  ways.  This  is  illustrated  by  our  examples  in  Exercise  19.2.  Some  sequences 
approach  some  horizontal  line  as  n  — >  °°.  Some  seem  to  be  shooting  off  to  infinity, 
others  jump  around  a  lot.  We  need  to  understand  how  these  sequences  differ  from 
each  other,  and  the  following  definitions  will  help  us  do  that. 

You’ll  notice  that  a  lot  of  terms  that  we  introduced  when  we  studied  sets  reappear 
here.  What  is  really  happening  is  that  each  sequence  (x„)  (where  order  counts)  gives 
rise  to  a  nonempty  set  S  =  {x„  :  n  G  N}  (where  order  doesn’t  count).  For  example, 
the  sequence  (z„)  where  zn  =  (—1)”  gives  rise  to  the  set  S  =  {—1, 1},  while  the  se¬ 
quence  (y„)~=1  where  y„  =  n  gives  rise  to  the  set  T  =  {n  :  n  S  Z+}  =  Z+.  The  set  S 
that  is  associated  with  the  sequence  (x„)  is  precisely  the  range  of  the  sequence.  Fur¬ 
thermore,  once  we  understand  this  connection,  we  can  ask  the  same  questions  about 
sequences  that  we  have  asked  about  sets:  For  example,  we  can  ask  when  a  sequence 
is  bounded  above,  bounded  below,  bounded,  has  an  infimum  or  supremum — and  we 
can  use  everything  we  learned  about  sets  to  find  an  answer. 

How  important  are  sequences?  Very  important.  In  fact,  we  believe  that  these  are 
so  important  to  your  future  mathematical  development  that  we  will  restate  all  the 
definitions  in  the  specific  setting  of  sequences.  For  example,  a  sequence  of  real 
numbers  (x„)  is  bounded  if  the  set  S  =  {xn  :  n  L  N }  is  bounded.  Thus,  according  to 
our  definition  of  bounded  set,  a  sequence  of  real  numbers  (x„)  is  bounded  above, 
if  there  is  a  real  number  M  such  that  xn<M  for  all  n,  and  bounded  below,  if  there 
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exists  a  real  number  m  such  that  xn  >  m  for  all  n.  A  sequence  is  bounded  if  it 
is  bounded  above  and  below.  That’s  how  we  defined  it.  But  you  may  find  that  the 
following  provides  a  more  useful  way  to  think  about  boundedness  in  R. 

Exercise  19.3.  Let  (x„)  be  a  sequence.  Prove  that  (xn)  is  bounded  if  and  only  if  there 
exists  a  real  number  N  such  that  \xn  \  <  N  for  all  n.  Q 

Just  as  before,  a  real  number  U  satisfying  xn  <  U  for  all  n  is  called  an  upper 
bound  of  the  sequence  (x„),  and  a  real  number  L  satisfying  L  <  xn  for  all  n  is 
a  lower  bound  of  the  sequence  (x„).  In  our  illustration  above,  we  considered  the 
sequence  (z„),  where  zn  -  (—  1)".  We  see  that  this  sequence  is  bounded  above  (1 
is  an  upper  bound)  and  bounded  below  (—  1  is  a  lower  bound),  and  therefore  it  is 
bounded.  Alternatively,  |z„|  <  1  for  all  n,  and  by  the  previous  exercise  we  may 
conclude  that  the  sequence  ( zn )  is  bounded. 

Exercise  19.4.  (a)  Give  an  example  of  a  real  number  that  is  not  an  upper  bound 

of  the  sequence  given  by  xn  =  n/(n  +  1). 

(b)  Complete  the  following  sentence:  The  real  number  U  is  not  an  upper  bound 
of  the  sequence  ( x„ ),  if  ... .  O 

If  (xn )  is  a  sequence  that  is  bounded  below,  the  set  S  =  {x„  :  n  £  N}  is  bounded 
below.  Since  S  is  a  nonempty  set  of  real  numbers  that  is  bounded  below,  the  in- 
fimum  version  of  the  completeness  axiom  (see  Exercise  12.9)  implies  that  S  has 
an  infimum,  and  we  call  this  the  infimum  of  the  sequence  (x„ )  (or  greatest  lower 
bound  of  (x,,)).  We  denote  it  by  infix,,).  Recall  that  you  showed  (Problem  12.17) 
that  the  infimum  is  unique.  The  definition  of  the  infimum  of  a  sequence,  without 
reference  to  the  set  associated  with  the  sequence,  is  given  in  the  example  below. 

Example  19.5.  Let  (x„)  be  a  sequence  that  is  bounded  below.  State  the  two  proper¬ 
ties  that  the  infimum  of  (xn)  must  satisfy. 

The  infimum  of  the  sequence  (x„ )  is  the  real  number  m  satisfying 

(i)  m  <  x„  for  all  n,  and 

(ii)  if  p  is  a  real  number  satisfying  p  <  x„  for  all  n,  then  p  <  m. 

Each  of  the  statements  (i)  and  (ii)  can  be  stated  in  the  vernacular,  and  you  should 
do  so  now.  O 


Exercise  19.6.  Guess  the  infimum  for  each  of  the  cases  below: 

(a)  xn  =  \/n,  forn  =  1,2,3, .. .; 

(b)  xn  =  «2,  for  n  £  N; 

(c)  x„  =n/(n+  1),  for  n  G  N; 

(d)  xn  =  (— 1)"/(«2  +  1),  for  n  gN. 
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When  you  look  for  the  infimum,  remember  that  it  may  or  may  not  appear  in  the 
sequence.  Everything  we  do  for  the  infimum  can  be  done  for  the  supremum.  So  the 
supremum  of  the  sequence  (x„)  (or  least  upper  bound  of  (xn))  is  the  supremum  of 
the  set  S  =  {xn  :  n  £  N}.  We  denote  it  by  sup(x„).  The  rest  is  left  to  you  in  the  next 
exercise. 

Exercise  19.7.  Let  (x„)  be  a  sequence  that  is  bounded  above.  State  the  two  proper¬ 
ties  that  the  supremum  of  (x„)  must  satisfy,  and  say  why  it  exists.  O 

That  takes  care  of  how  high  and  how  low  a  sequence  can  go.  Now  we  turn  to  how 
it  gets  where  it  is  going. 

A  sequence  (x„)  is  increasing  if  xn  <  xn+\  for  all  n,  and  decreasing  if  x„  >  xn+\ 
for  all  n.  We  say  the  sequence  (x„)  is  strictly  increasing  if  x„  <  x„+i  for  all  n. 
Likewise,  a  sequence  (x„)  is  strictly  decreasing  if  xn  >  x„+i  for  all  n. 

Exercise  19.8.  The  object  of  this  exercise  is  to  make  sure  you  understand  the  defi¬ 
nitions  above.  Either  explain  why  you  cannot  give  an  example  of  the  following,  or 
give  an  example  of 

(a)  a  bounded  sequence.  Lind  an  upper  bound  and  a  lower  bound. 

(b)  a  sequence  that  is  bounded  below,  but  not  bounded  above.  Lind  a  lower  bound. 
Must  the  sequence  be  increasing? 

(c)  a  sequence  that  is  bounded  above,  but  not  bounded  below.  Lind  an  upper 
bound. 

(d)  an  increasing  sequence  that  is  neither  bounded  above  nor  below. 

(e)  a  strictly  increasing  bounded  sequence. 

(f)  a  strictly  decreasing  sequence  that  is  bounded  above,  but  not  below. 

(g)  a  sequence  that  is  neither  strictly  increasing  nor  strictly  decreasing.  O 

Exercise  19.9.  What  is  your  best  guess  for  the  supremum  of  the  sequence 


xn  =  0.999. ..9? 


In  Problem  20.18  of  the  next  chapter  you  will  give  a  rigorous  proof  of  this  fact.  O 

Since  sequences  are  functions,  we  can  manipulate  them  algebraically.  Lor  exam¬ 
ple,  to  add  two  sequences  together,  we  define  the  sum  (x„)  +  (y„)  to  be  the  sequence 
(x„  +>'„).  In  the  same  way,  we  may  subtract  sequences. 

Example  19.10.  Suppose  (x„)  and  (y„)  are  two  bounded  sequences.  If  sup(x„)  =  l 
and  sup(y„)  =  m,  is  sup(x„  +yn)  =  l  + wi? 

We’ll  first  show  that  l  +  m  is  an  upper  bound  of  (x„  +y„).  (Thus  it  makes  sense  to 
talk  about  the  supremum  of  (x„  +yn).)  Then  we  will  try  to  show  the  second  thing: 
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that  l  +  m  is  the  least  of  all  the  upper  bounds,  in  the  sense  defined  above.  Remember 
that  since  we  don’t  know  the  answer  here,  our  attempt  might  fail. 

So  let  /  =  sup(x„)  and  m  =  sup(y„).  Then  x„  <  /  for  all  n  and  y„  <  m  for  all  n. 
Thus  xn  +yn  <  l  +  in  for  all  n.  So  far  so  good;  we  know  that  l  +  m  is  an  upper  bound 
(and  we  know  that  the  sequence  (x„  +  y„ )  is  bounded  above).  But  we  still  have  to 
see  whether  or  not  l+m  is  the  least  upper  bound.  So  suppose  that  p  is  another  upper 
bound.  We  are  supposed  to  show  that  p  >  l+m.  Well,  p  >  xn+yn  for  every  n,  but 
that  doesn’t  seem  to  help  since  that  doesn’t  (in  general)  imply  anything  about  the 
relation  between  p  and  x„  or  p  and  yn.  In  fact,  closer  inspection  reveals  that  if  one  of 
the  sequences  is  negative,  we  can’t  say  anything  at  all.  So  we  abandon  our  attempt 
at  a  proof  and  search  for  a  counterexample,  using  what  we  learned  above. 

Let  (x„)  be  a  bounded  nonconstant  sequence,  say  xi  =  1  and  xn  =2  for  all  other 
n.  Thus  1  =  2.  Now  let  y„  =  —xn.  Then  in  =  —  1.  So  x„  +  y„  =  0,  and  the  supremum 
of  (x„  +yn)  is  clearly  0,  while  l  +  m=  1.  Hence  the  supremum  of  (x„+yn)  need  not 
be  /  +  m.  O 

The  lesson  here  is  that  in  trying  to  prove  something,  we  came  up  with  an  ex¬ 
ample  that  showed  it  wasn’t  true.  This  is  a  perfectly  reasonable  way  to  approach 
the  problem  as  long  as  we  are  always  on  the  lookout  for  what  can  go  wrong  with  a 
proof. 

Exercise  19.11.  Let  ( xn )  be  a  sequence  that  is  bounded  below.  Let  l  =  inf(x„).  Show 
that  (—  x„)  is  bounded  above  and  find  the  supremum  of  the  sequence.  Q 

Recall  that  we  introduced  recursively  defined  functions  in  the  last  chapter:  The 
recursively  defined  function  had  domain  N  and  codomain  X.  Now  X  was  allowed 
to  be  any  nonempty  set,  but  the  domain  had  to  be  N  (or  a  set  of  the  form  {x  G  N  : 
x  >  m}  for  some  m  G  Z).  So,  these  functions,  our  motivating  example,  «!,  and  every 
other  function  to  which  we  applied  the  recursion  theorem  are  all  sequences.  Many 
sequences  are  defined  recursively,  including  several  famous  sequences,  as  we  will 
see  below.  If  a  sequence  is  defined  by  giving  a  formula  for  x„  in  terms  of  n,  then  we 
say  that  the  sequence  is  defined  explicitly.  It  is  possible  for  a  sequence  to  have  both 
a  recursive  definition  as  well  as  an  explicit  one,  as  we  will  see  below. 

Exercise  19.12.  Reconsider  the  function  of  Exercise  18.7,  g  :  N  — >  R,  defined  re¬ 
cursively  by  g(0)  =  1  and  gin  +  1)  =  5 g(n)  for  every  n  G  N.  Then  g  is  a  familiar 
function.  What  is  it?  O 

One  of  the  most  famous  examples  of  a  sequence  defined  recursively  is  the  Fi¬ 
bonacci  sequence.  Fibonacci,  whose  real  name  is  Leonardo  Pisano,  was  born  in 
1 170  in  Pisa.  (One  source  you  might  consult  for  more  information  about  Leonardo 
Pisano  is  L.  Sigler’s  book  [98].)  The  Fibonacci  sequence  is  often  presented  with 
pictures  of  rabbits.  So  here  is  a  version  of  Fibonacci’s  original  rabbit  problem:  Sup¬ 
pose  that  rabbits  live  forever.  Starting  at  the  age  of  two  months,  each  pair  produces 
(exactly)  one  baby  pair,  and  continues  to  do  so  every  month  thereafter.  If  we  start 
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with  one  brand  new  pair,  how  many  pairs  of  rabbits  will  we  have  in  the  /7th  month? 
Now  here’s  the  sequence.  See  if  you  can  figure  out  the  reference  to  these  rabbits. 

Define  h\)  =  0.  F\  =  1 ,  and  Fn+\  =  Fn+Fn- \  for  n  >  1.  This  sequence  is  called 
the  Fibonacci  sequence  and  the  terms  of  the  sequence  are  called  the  Fibonacci 
numbers.  In  this  example,  the  recursion  uses  two  preceding  terms  of  the  sequence. 
This  complicates  the  application  of  the  recursion  theorem.  In  Problem  19.17  we  will 
ask  you  to  provide  the  details  of  this  application  of  the  recursion  theorem. 

Example  19.13.  Find  the  first  seven  terms  of  the  Fibonacci  sequence. 

The  first  seven  Fibonacci  numbers  are:  F$  =  0.  F\  =  1,  Fi  =  1,  Ft,  =  2,  F4  =  3, 
F5  =  5,F6  =  8.  O 

The  Fibonacci  sequence  is  extremely  appealing  to  mathematicians  and  nonmath¬ 
ematicians.  In  fact,  there  are  many  web  sites  and  books  with  information  and  prob¬ 
lems  about  Fibonacci  sequences,  as  well  as  the  journal  The  Fibonacci  Quarterly. 

We  present  one  of  the  many  interesting  patterns  found  in  Fibonacci  numbers 
below.  Others  can  be  found  in  the  problems,  as  well  as  some  of  the  references  given 
in  this  chapter. 

Exercise  19.14.  Let  (Fn)  denote  the  Fibonacci  sequence.  Show  that  for  every  posi¬ 
tive  integer  n  the  equation  Fn+\Fn_  \  —  F„  =  (—1)"  holds. 

Check  the  equation  for  the  first  few  values  of  n  to  see  if  this  is  reasonable  (but, 
of  course,  this  is  not  a  proof).  The  proof  of  Exercise  19.14  will  use  mathematical 
induction,  and  you  can  read  our  solution  below  when  you  are  ready.  Q 

We  now  return  to  the  solution  of  the  Josephus  problem  mentioned  at  the  begin¬ 
ning  of  the  chapter.  For  each  integer  n  >  2,  we  let  f(n  )  denote  the  number  of  the  win¬ 
ning  child.  Then  /( 2)  =  1,  /( 3)  =  3,  f(2n)  =  2 f(n)  —  1,  and  f(2n+  1)  =  2 f(n)  +  1. 
Note  that  a  variation  on  our  “recursion  theorem  theme”  made  this  solution  possible! 
For  more  information  on  the  Josephus  problem,  we  recommend  the  article  [97]. 


Definitions 


Definition  19.1.  A  sequence  is  a  function  /  from  the  natural  numbers  N  to  a  set  X. 
It  is  standard  to  write  xn  =  f(n),  and  to  refer  to  xn  as  a  term  in  the  sequence.  The 
sequence  will  be  denoted  ( x„ ). 

Definition  19.2.  A  sequence  of  real  numbers  (xn)  is  bounded  above,  if  there  is  a 
real  number  M  such  that  xn  <  M  for  all  n.  and  bounded  below,  if  there  exists  a  real 
number  m  such  that  xn  >  m  for  all  n.  A  sequence  is  bounded  if  it  is  bounded  above 
and  below. 

Definition  19.3.  A  real  number  U  satisfying  Xn  <  U  for  all  n  is  called  an  upper 
bound  of  the  sequence  ( xn ),  and  a  real  number  L  satisfying  L  <  x„  for  all  n  is  a 
lower  bound  of  the  sequence  ( x„ ) . 
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Fig.  19.2  Graph  of  ( x„ ),  where  x„  =  1  —  (—1)"  for  n  >  0 


Definition  19.4.  The  infimum  (or  greatest  lower  bound)  of  the  sequence  of  real 
numbers  (x„)  that  is  bounded  below  is  the  real  number  inf(x„)  satisfying 

(i)  inf(x„)  <  xm  for  all  m,  and 

(ii)  if  p  is  a  real  number  satisfying  p  <  xm  for  all  m,  then  p  <  infix,,). 

Definition  19.5.  The  supremum  (or  least  upper  bound)  of  the  sequence  of  real 
numbers  (x„)  that  is  bounded  above  is  the  real  number  sup(x„)  satisfying 

(i)  sup(x„)  >  xm  for  all  m,  and 

(ii)  if  p  is  a  real  number  satisfying  p  >  xm  for  all  m,  then  p  >  sup(x„). 

Definition  19.6.  A  sequence  (x„)  is  increasing  if  x„  <  x„+i  for  all  n.  It  is  strictly 
increasing  if  x„  <  xn+]  for  all  n. 

Definition  19.7.  A  sequence  (x„)  is  decreasing  if  x„  >  x„+i  for  all  n.  It  is  strictly 
decreasing  if  x„  >  xn+\  for  all  n. 

Definition  19.8.  Define  Fo  =  0,Ti  =  1,  and  Fn  =  +F;|_2  for  n  >  2.  This  se¬ 

quence  is  called  the  Fibonacci  sequence  and  the  terms  of  the  sequence  are  called 

the  Fibonacci  numbers. 


Solutions  to  Exercises 

Solution  (19.2). 

(a)  x'o  =  0,  x\  =  2,  X2  =  0,  and  X3  =  2.  The  first  eleven  terms  are  graphed  in 
Figure  19.2. 

(b)  xo  =  0,  xi  =  1  /2,  x'2  =  2/3,  and  X3  =  3/4.  The  first  eleven  terms  are  graphed 
in  Figure  19.3. 


Xn 

1 


U  1  b  1U  n 

Fig.  19.3  Graph  of  (x„),  where  x„  =  n/(n  +  1)  for  n  >  0 
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Fig.  19.4  Graph  of  ( xn ),  where  xn  =  [rr  +  1)/(1  —n)  forn  >  2 

(c)  X2  =  —5,  X3  =  —5,  X4  =  —17/3  «  —5.67,  and  X5  =  —13/2  =  —6.5.  The  first 
nine  terms  are  graphed  in  Figure  19.4. 

Solution  (19.3).  If  you  have  solved  Problem  12.8,  then  you  have  solved  this  exer¬ 
cise  as  well.  If  not,  here  is  a  solution. 

First  we’ll  prove  that  if  (x„)  is  bounded,  then  there  exists  a  real  number  N  such 
that  \x„  \  <  N  for  all  n.  By  the  definition  of  bounded  sequence,  there  exist  real  num¬ 
bers  m  and  M  such  that  m  <  xn  <  M ,  for  all  n.  Hence  —  \m\  <m<xn<M  <  \M\  for 
all  n.  Letting  N  =  max{|mj,  |M|},  we  have  —N  <  xn  <  N .  Thus  \x„\  <N  for  all  n. 

Finally,  we  prove  that  if  there  exists  a  real  number  N  such  that  \x„  \  <  N  tor  all  «, 
then  (x„)  is  bounded.  Since  \x„\  <N  for  all  n,  we  have  —N  <  x„  <  /V  for  all  n.  This 
shows  that  (x„)  is  bounded  below  and  bounded  above.  Hence  the  sequence  (x„)  is 
bounded. 

Solution  (19.4).  Many  answers  are  possible  for  (a). 

(a)  Consider  the  number  m  =  —  1.  Then  m  is  not  an  upper  bound  of  the  sequence 
since  xq  >  m. 

(b)  The  real  number  U  is  not  an  upper  bound  of  the  sequence  (x„)„6n  if  there 
exists  n  £  N  such  that  x„  >  U. 

Solution  (19.6).  The  answers  are:  (a)  0,  (b)  0,  (c)  0,  (d)  —1/2. 

Solution  (19.7).  A  real  number  U  is  the  supremum  of  a  sequence  (x„)  if  (i)  xn<U 
for  all  n,  and  (ii)  if  V  is  another  upper  bound  of  (x„),  then  U  <  V. 

The  set  {x„}  is  bounded  above  and  thus,  by  the  completeness  axiom  of  R,  this  set 
has  a  supremum.  The  supremum  of  this  set  is  the  supremum  of  the  sequence  (x„). 

Solution  (19.8).  You  should  be  able  to  find  examples  for  all  parts  of  this  problem, 
except  part  (d).  An  increasing  sequence  will  always  be  bounded  below,  and  its  first 
term  will  serve  as  a  lower  bound.  For  parts  (a)  and  (g),  the  sequence  ((—1)")  yields 
such  an  example.  An  upper  bound  for  this  sequence  is  10  and  a  lower  bound  is 
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—  10  (of  course  many  other  choices  are  possible).  For  (c)  and  (f),  you  can  use  the 
sequence  (— n),  which  is  bounded  above  by  0  but  is  not  bounded  below.  For  (b),  the 
sequence  defined  by  xn  =  n  +  2(— 1)"  for  n  £  N  is  bounded  below  (by,  for  example, 

—  100)  and  this  sequence  is  not  increasing  (since  xo  =  2  and  x\  =  —1).  Finally,  for 
(e)  the  sequence  (1  —  1  /«)  for  n  £  Z+  serves  as  an  example. 

Solution  (19.9).  Your  guess  was  surely  1 .  Right? 

Solution  (19.11).  Since  /  =  infix,,),  we  know  that  x„  >  l  for  all  n.  Multiplying  both 
sides  by  —1  we  obtain  —x„  <  —I  for  all  «,  and  consequently  (—  xn)  is  bounded 
above  and  —l  is  an  upper  bound.  We  claim  that  — /  is  the  supremum,  too.  Suppose 
that  m  is  also  an  upper  bound.  Then  -xn  <  m  for  all  n.  Multiplying  by  —1,  we  see 
that  xn  >  —m  for  all  n.  But  this  implies  that  —m  is  a  lower  bound  for  (x„).  Since 
l  =  infix,,),  we  know  that  —  m  <  1.  Thus  m  >  — and  —l  is  the  least  of  all  the  upper 
bounds.  So  —l  =  sup(—  x„). 

Solution  (19.12).  We  note  that  g(0)  =  l,g(l)  =  5,g(2)  =  52,  and  g( 3)  =  53.  We 
guess  that  g(n)  =  5"  for  all  n  £  N.  We  prove  this  guess  using  induction. 

For  the  base  step  of  n  =  0  we  have  g(0)  =  1=5°. 

Now  for  the  induction  step,  we  let  n  £  N  and  we  suppose  that  g(n)  =  5".  Then 
g(n  +  1)  =  5 g{n)  by  the  recursive  definition.  Using  the  induction  hypothesis  we  get 
5(g(n))  =  5(5").  We  conclude  that  g(n  +  1)  =  5”+1,  completing  the  induction  step. 
By  mathematical  induction  the  function  is  g(n)  =  5",  as  we  guessed  above. 

Solution  (19.14).  We  will  prove  the  validity  of  this  equation  using  induction.  We 
will  consider  two  base  steps  here.  For  n  =  1,  we  easily  check  that  FjFq  —  F2  =  —  1 . 
Similarly,  for  n  =  2,  we  can  check  that  F3F1  —  F22  =  1.  For  the  induction  step,  let 
n  £  N  with  n>2  and  suppose  that  F,1+iF„_i  —  F^  =  (—1)".  In  other  words,  we 
assume  that  F2  =  F„+iF„_i  —  (—1)".  We  will  show  that  Fn+2Fn  —  Fn2+1  =  (— 1)"+1. 
To  see  this,  note  that  Fn+ 2  =  Fn+\  +F„  for  all  n.  Thus  (you  should  fill  in  reasons  for 
each  of  the  equalities): 


F„+2Fn-F^+l  =  (F„+i+F„)F„-F„2+1 

=  Fn+iF„  +  Fn  —  Fn+l . 

Use  the  induction  hypothesis  to  replace  the  middle  term,  F,2,  in  the  summand  above 
to  conclude  that 

Fn+2Fn—F~+ 1  =  F„+iF„  +F„+iF„_i  -  (-1)"  —  F„2+1 
=  Fn+l(Fn  +  Fn-X)  -F2+1  +  (-1)"+1 
=  (-1)"+1, 


and  the  result  now  follows  from  the  principle  of  mathematical  induction. 
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The  first  100  Fibonacci  numbers  can  be  found  on  the  Web  [60].  From  there  you 
can  get  to  a  very  cute  picture  of  little  rabbits,  as  well  as  a  set  of  puzzles  based  on 
these  numbers.  Fibonacci  numbers  also  make  an  appearance  in  the  popular  chil¬ 
dren’s  book  The  Number  Devil  by  Hans  Magnus  Enzensberger  [25]. 


Problems 

Problem  19.1.  Graph  the  following  sequences  and  briefly  describe  each  of  the 
graphs: 

(a)  xn  =  (—1)",  where  n  G  N; 

(b)  x„  =  1/2",  where  n  G  N; 

(c)  xn  =  n/(n  —  1),  where  n  G  N  and  n  >  2; 

(d)  xn  =  (— 1)"/2",  where  n  G  N; 

(e)  xn  =  (— 1  )"(u2/(n  +  1)),  where  n  G  N. 

Problem  19.2.  Prove  that  the  sequence  (xn ) ,  defined  by  xn  =  jjjj,  is  strictly  increas¬ 
ing. 

Problem  19.3.  In  what  follows,  no  rigorous  proofs  are  required.  However,  you 
should  provide  a  brief  explanation  of  your  answers. 

(a)  Give  an  example  of  a  sequence  of  rational  numbers  that  is  bounded  above. 

(b)  Give  an  example  of  a  sequence  of  rational  numbers  that  has  no  upper  bound, 
but  does  have  a  lower  bound. 

(c)  Give  an  example  of  a  strictly  increasing  sequence  of  numbers  that  has  a  supre- 
mum,  but  such  that  the  supremum  is  not  a  term  in  the  sequence.  Can  you  find 
a  strictly  increasing  sequence  (x„)  such  that  the  supremum  is  equal  to  xn  for 
some  u?  Why  or  why  not? 

Problem  19.4.  The  game  of  chess  seems  to  have  some  of  its  roots  in  India.  As  the 
story  goes,  the  emperor  of  India  was  smitten  with  the  new  game,  and  he  asked 
the  inventor  of  chess  how  he  could  reward  him  for  this  marvelous  invention.  The 
(apparently  modest)  reply  was  to  ask  for  one  grain  of  rice  for  the  first  square  of  the 
chess  board,  double  that  for  the  second,  double  that  for  the  third,  and  so  on.  Thus, 
for  each  square  after  the  first,  the  inventor  receives  double  the  number  of  grains  he 
received  on  the  previous  square. 

Let  xn  =  number  of  rice  grains  for  the  nth  square  of  the  chessboard.  Then  (xn )  is 
a  sequence. 

(a)  Find  the  first  four  terms  of  (xn). 

(b)  Find  an  explicit  formula  for  x„. 

Consider  the  new  sequence  ( a„ ),  where  an  =  total  number  of  rice  grains  for  the  first 
n  squares.  You  will  need  to  use  the  formula  for  a  geometric  sum:  For  a  real  number 

a  ^  1  and  positive  integer  n  we  have  1  +  a  H - ha"  =  (1—  a"+1 ) /( 1  —  a) .  (If  this 

formula  is  unfamiliar  to  you,  prove  it!) 
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(c)  Find  the  first  four  terms  of  (a„). 

(d)  Find  an  explicit  formula  for  a„. 

(e)  What  is  the  mass  of  the  rice  the  inventor  asked  for?  (Use  the  fact  that  the  mass 
of  one  grain  of  rice  is  approximately  25  mg.) 

(f)  The  rice  production  of  the  whole  world  in  2004  was  approximately  600  mil¬ 
lion  metric  tons  of  rice.  How  does  this  compare  with  the  inventor’s  request? 

Problem  19.5.  Phenytoin  is  a  medication  designed  to  control  seizures.  It  is  admin¬ 
istered  twice  daily  at  8:00  a.m.  and  at  8:00  p.m.  The  dosage  at  each  administration 
is  75  mg.  The  retention  rate  after  12  hours  is  65%  (that  is,  12  hours  after  intake, 
65%  of  the  drug  remains  in  the  body).  Let  S„  denote  the  amount  of  phenytoin  in  the 
patient’s  body  shortly  after  the  nth  administration  of  the  drug. 

(a)  Calculate  the  first  four  terms  of  the  sequence  (S„)”=1. 

(b)  Give  a  recursive  definition  of  this  sequence. 

(c)  Prove  that  (Sn)  is  strictly  increasing,  bounded,  and  that  sup(5’„)  exists. 

We  let  sn  denote  the  amount  of  phenytoin  in  the  patient’s  body  shortly  before  the 
nth  administration  of  the  drug. 

(d)  Express  s„  in  terms  of  S„. 

(e)  Indicate  why  (s„)  is  also  increasing,  bounded,  and  sup(s„)  exists. 

(f)  Find  the  minimum  and  maximum  amount  of  phenytoin  in  the  patient’s  blood 
following  the  8  a.m.  dose  on  day  3  and  prior  to  the  8  a.m.  dose  on  day  4. 

Problem  19.6.  Give  an  example  of  a  sequence  of  rational  numbers  that  has  an  irra¬ 
tional  number  as  supremum.  Prove  your  claim! 

Problem  19.7.  We  define  a  sequence  (a„)  by  ci\  =  1  and  an+\  =  3  —  1  / an  for  all 
n>  1. 

(a)  Show  that  (an)  is  bounded. 

(b)  Prove  that  [an)  is  strictly  increasing. 

Problem  19.8.  We  define  the  sequence  (x„)  by  xn  =  for  n  £  N. 

(a)  Prove  that  ( x„ )  is  bounded. 

(b)  Is  ( x„ )  increasing?  Prove  it  or  give  a  counterexample. 

(c)  Find  inf(x„). 

Problem  19.9.  If  /  =  sup(x„),  what  is  inf(— x„)?  (You  should  know  by  now  that  the 
first  thing  to  do  is  to  try  examples.  Make  up  at  least  three  different  examples.)  State 
your  conjecture.  Prove  it. 

Problem  19.10.  If  /  =  sup(x„),  what  is  sup(kx„)  where  k  £  K 1  ?  Prove  your  conjec¬ 
ture. 

Problem  19.11.  Let  (x„)  be  a  bounded  sequence  such  that  x„  <  —2  for  all  n£  N. 
(a)  Prove  that  (x^)  is  bounded. 
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(b)  Let  l  =  inf(x„)  and  m  =  sup(x„).  Find  inf(x^)  in  terms  of  /  or  m,  or  both.  Prove 
that  your  result  is  correct. 

Problem  19.12.  Suppose  that  (x„)  and  (y„)  are  bounded  below. 

(a)  Show  that  inf(x„  +  y„)  >  inf(x„)  +  inf(v„). 

(b)  Is  it  always  true  that  inf(x„  +yn)  =  inf(x„)  +inf(y„)?  Prove  this  or  give  a 
counterexample. 

Problem  19.13.  Suppose  that  (x„)  and  (y„ )  are  bounded  below.  Is  it  always  true  that 
inf(x„yn)  =  inf(x„)  inf(y„)?  Prove  this  or  give  a  counterexample. 

Problem  19.14.  Let  (xn )  and  (y„ )  be  two  sequences  of  real  numbers.  Assume  that 
( yn )  is  bounded  above  and  that  x„  <  yn  for  all  n  £  N. 

(a)  Prove  that  (x„ )  is  also  bounded  above. 

(b)  Prove  that  sup(x„)  <  sup(y„). 

(c)  Do  the  assumptions  imply  that  sup(x„)  <  sup(y„)?  If  yes,  prove  it;  if  no,  find 
a  counterexample. 

Problem  19.15.  Let  (x„)  and  (y„ )  be  two  sequences  of  real  numbers.  Assume  that 
(x„)  is  bounded  above  and  that  x„  <  yn  <  x„+i  for  all  n  £  N. 

(a)  What  can  you  say  about  sup(x„)  and  sup(y„)?  Must  they  exist?  If  so,  how  do 
they  compare? 

(b)  What  can  you  say  about  inf(x„)  and  inf(y„)?  Must  they  exist?  If  so,  how  do 
they  compare? 

Problem  19.16.  We  define  a  sequence  (x„)  recursively  by  letting  xo  =  1000  and 
x„  =  (.05)x„-i  for  n  >  1.  Find  another  representation  for  this  sequence.  Have  you 
seen  this  anywhere  else  before?  If  so,  where? 

Problem  19.17.  With  the  help  of  the  recursion  theorem,  prove  that  the  Fibonacci 
sequence  (see  Definition  19.8)  is  well-defined;  that  is,  specify  a,  X,  and  /  appearing 
in  the  recursion  theorem.  (You  can  use  Example  18.8  to  suggest  an  approach  to  this 
problem.) 

Problem  19.18.  (a)  Prove  that  the  Fibonacci  sequence  is  increasing. 

(b)  Prove  that  the  Fibonacci  sequence  is  unbounded. 

Problem*  19.19.  Let  (Fn)  be  the  Fibonacci  sequence  and  x„  =  Fn+i/Fn,  for  n  >  1. 
Show  that  x„  =  1  +  1  /x„_i ,  for  n  >  2. 

Problem  19.20.  Prove  the  following  explicit  formula  for  the  nth  Fibonacci  number: 


_  a "  -  bn 
a  —  b 


where  a  = 


1-V5 


2 


Problem  19.21.  Let  /( 0)  =  2,  /(l)  =  2,  and  define  f(n  +  1)  =  f(n)f{n  —  1).  Find 
an  explicit  formula  for  f{n). 
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Problem  19.22.  Let  (Fn)  denote  the  Fibonacci  sequence.  Define  the  Lucas  sequence 
by  Lq  =  2,  L\  =  1,  and  for  n  >  1  define  L„+i  =  Ln  +L„_  i.  (For  some  proofs  you 
may  want  to  use  the  second  principle  of  induction  stated  in  the  problem  section  of 
Chapter  18  as  Theorem  18.9.) 

(a)  Calculate  Li, ...  ,Lio- 

(b)  Calculate  L„  —Fn-\,  for  n  >  1.  Find  a  remarkable  pattern  in  this  list  of  num¬ 
bers.  State  it  clearly  and  prove  it  by  induction. 

(c)  Calculate  Fn  +Ln.  Find  a  remarkable  pattern  in  this  list  of  numbers.  State  it 
clearly  and  prove  it  using  part  (b). 


Chapter  20 

Convergence  of  Sequences  of  Real  Numbers 


As  we  saw  in  the  last  chapter,  when  we  graph  several  terms  of  a  sequence,  certain 
behavior  may  appear.  We  may  become  convinced,  for  whatever  reason,  that  the 
sequence  is  unbounded.  Or,  we  may  believe  that  the  sequence  is  bounded  and  we 
may  even  notice  the  sequence  moving  toward  a  particular  horizontal  line.  But  how 
do  we  check  that  what  we  believe  is  happening  really  is  happening? 

Our  efforts  to  explain  this  require  that  you  fully  understand  how  to  measure  dis¬ 
tance.  So  we  remind  you  that  distance  is  usually  measured  using  the  absolute  value 
function,  or  |jc|,  and  the  absolute  value  of  a  real  number  x  measures  the  distance 
from  x  to  0.  If  we  want  to  measure  the  distance  between  two  real  numbers  x  and  a, 
we  would  need  to  look  at  |x  —  a\ . 

For  an  arbitrary  positive  real  number  e,  we  know  what  it  means  to  say  |x|  =  £. 
What  does  it  mean  to  say  |x|  <  £?  The  answer  is,  as  you  can  check,  that  |x|  <  £  if 
and  only  if  — £  <  x  <  £.  So  how  do  we  determine  when  a  sequence  of  real  numbers 
approaches  a  real  number  LI  We  will  use  the  absolute  value  function  to  measure 
the  distance  from  terms  in  the  sequence  to  L.  We  make  this  precise  in  the  following 
definition. 

We  say  that  a  sequence  (xn)  converges  if  there  exists  a  real  number  L  such  that 
for  all  £  >  0  there  exists  a  real  number  N  such  that  \x„  —  L\  <  £  for  all  n  >  N.  If  such 
an  L  exists,  we  call  L  the  limit  of  the  sequence  (x„),  we  say  that  (x„ )  converges  to  L, 
and  we  write  x„  — >  L  or  I i rn„  x„  =  L.  If  no  such  L  exists,  we  say  that  the  sequence 
diverges.  While  we  allow  A  to  be  a  real  number,  we  caution  you  to  remember  that 
the  indices  on  the  terms  of  a  sequence,  x„,  are  natural  numbers.  To  really  understand 
this  definition,  we  must  understand  it  visually,  and  we  must  also  know  how  to  use  it 
to  show  that  a  sequence  converges.  We  first  turn  to  the  visual  aspect  of  convergence. 

Let’s  think  about  the  definition.  If  we  believe  the  sequence  (x„)  converges,  then 
we  need  to  find  a  real  number  L  such  that  the  sequence  gets  really  really  close  to  the 
line  y  =  L.  Mathematically,  we  say  that  we  need  the  sequence  arbitrarily  close  to  the 
horizontal  line.  This,  in  turn,  implies  that  the  distance  from  the  sequence  to  the  line 
y  =  L  should  be  less  than  every  positive  real  number  £  that  we  can  think  of;  that  is 
what  arbitrarily  close  means.  But  it  may  not  happen  right  away;  it  may  only  happen 
eventually  (for  each  £,  there  will  exist  N  such  that  x„  may  not  satisfy  what  we  want 
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Fig.  20.1  Definition  of  a  convergent  sequence;  two-dimensional  illustration 


for  n  <  N,  but  it  will  satisfy  it  for  n  >  N).  That’s  why  we  defined  convergence  the 
way  we  did. 

We  illustrate  this  definition  in  Figure  20.1.  In  this  picture,  we  first  pick  a  value 
£  =  £i  >0.  Then  we  indicate  a  corresponding  real  number  N  =  N\  such  that  for 
all  n  >  N ]  we  have  \xn  —  L\  <  £] .  Looking  at  the  figure,  we  see  that  the  strip  from 
y  =  L  —  E\  to  y  =  L  +  £  |  contains  all  the  terms  of  the  sequence  from  X5  on.  Generally, 
the  smaller  the  value  of  £,  the  larger  the  value  of  N.  Let’s  think  about  why  this  is 
true:  Returning  to  Figure  20.1,  we  see  that  £  =  £2  is  smaller  than  £1,  and  this,  in 
turn,  forces  the  sequence  to  be  closer  to  the  horizontal  line.  So  we  go  farther  out  in 
the  sequence  to  get  closer  to  the  line  y  =  L.  You  also  probably  noticed  that  once  we 
find  a  value  of  N  that  works,  anything  larger  than  N  will  work,  too. 

Figure  20.2  below  is  yet  another  way  to  illustrate  the  same  situation.  Explain  this 
sketch  to  yourself. 

Now  let’s  turn  to  how  we  show  a  sequence  converges.  First  we  make  a  conjecture 
as  to  the  value  of  the  limit,  call  it  L.  The  important  thing  to  notice,  when  we  make 
our  conjecture,  is  that  we  are  interested  in  the  behavior  as  n  — >  We  don’t  really 
care  what  happens  for  small  n,  for  example.  So  if  we  ask  what  a  sequence  converges 
to,  you  can  guess  by  ignoring  terms  that  don’t  really  matter  in  the  long  run.  For 
example,  if  we  ask  you  to  guess  what 

/  2«2  +  3«+4\ 

\4«2  —  3«  —  5  / 


Xl 


x7 


x5 


L -£1  L— £2  L  L+£2  L+£i 


Xo 


Fig.  20.2  Definition  of  a  convergent  sequence;  one-dimensional  illustration 
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converges  to,  you  would  remember  that  only  behavior  near  infinity  matters,  so  you 
would  probably  guess  that  2 n2  really  dominates  the  quantity  in  the  numerator,  while 
4 n2  really  dominates  the  quantity  in  the  denominator.  Thus,  when  you  guess  this 
limit,  you’ll  probably  guess  that  it’s  the  same  as  the  limit  of  the  sequence  with  terms 
xn  =  2n2/(4n2) — which  it  is.  You  probably  also  would  guess  that  this  limit  is  1/2, 
and  you’d  be  right  again.  Before  we  move  to  our  second  step,  try  practicing  your 
guessing  on  some  of  the  examples  below. 

Exercise  20.1.  Guess  the  limits  of  each  of  the  following  sequences: 

3n2+n  +  4\ 

3  +  n  +  5n2J’ 

<->  (^> 


Now,  once  you  have  guessed  your  limit,  you  must  prove  that  it  is  correct.  To  do 
this,  assume  that  we  are  given  an  arbitrarily  small  number,  which  we  denote  by  £. 
We  then  try  to  find  a  real  number  N  (depending  on  e)  so  that  for  the  remaining  terms 
of  the  sequence,  that  is,  for  n  >  N,  we  have  \xn  —  L\  <  £.  This  is  where  things  get 
tricky. 

Let’s  see  how  we  would  use  this  definition,  starting  with  a  fairly  simple  example. 
Example  20.2.  Show  that  the  sequence  [xn)  defined  by  xn  =  l/n  converges  to  0. 

We’ll  find  this  easier  if  we  follow  Polya’s  list. 

“Understanding  the  problem."  We  need  to  show  that  for  every  £  >  0  there  exists 
a  real  number  N  such  that  n  >N  implies  that  \x„  —  0|  <  £.  We  fill  in  what  we  can, 
remembering  that  £  is  arbitrary;  that  is,  it  is  chosen  for  us  and  we  have  no  control 
over  its  value.  So  once  someone  gives  us  £,  we  are  supposed  to  come  up  with  N  so 
that  |  l/n  —  0|  <  £  for  n  >  N. 

“Devising  a  plan."  Now  we’ll  work  backwards  to  see  what  N  has  to  be.  We  need 
\\/n\  <  £.  So  we  need  n  >  1/e.  It  appears  that  if  we  take  N  >  1/e,  say  N  =  2/e, 
we’ll  get  exactly  what  we  need  when  n>N.  Since  N  depends  on  £  (there’s  an  £  in 
our  definition  of  N),  many  authors  write  N  =  N(e). 

Before  we  carry  out  the  plan,  note  that  if  we  knew  £,  then  this  would  tell  us 
exactly  how  big  N  has  to  be.  If  £  =  0.1,  then  N  needs  to  be  bigger  than  1/0.1  =  10; 
if  £  =  0.01,  then  N  needs  to  be  bigger  than  1  /0.01  =  100;  and  if  £  just  happens  to 
be  itself,  then  N  needs  to  be  greater  than  1/e.  We’re  now  ready  to  carry  out  the  plan. 
We  have  to  write  this  up,  so  that  a  reader  who  has  not  seen  our  work  will  know  what 
we  are  doing. 

Proof.  If  £  >  0  and  we  choose  N  =  2/e,  then  for  n  >  N,  we  have  \xn  —  0|  =  |  l/n  —  Oj 
=  \/n<\  /N  =  e/2  <  £.  Therefore,  if  n  >  N,  then  \xn  —  0|  <  £  as  desired.  □ 
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Our  second  example  is  a  bit  more  challenging,  and  requires  slightly  different 
techniques.  You’ll  see  more  problems  of  this  type  and  more  challenging  limit  prob¬ 
lems  when  you  take  your  first  analysis  course. 

Example  20.3.  Show  that  limn_»oo«/(n  +  2)  =  1. 

As  above,  we’ll  first  understand  our  problem,  and  then  devise  a  plan. 

“Understanding  the  problem.”  We  start  by  writing  out  the  definition:  For  all  e  >  0 
there  exists  N  such  that  n>N  implies  that  \x„  —L\  <  £. 

Next  we’ll  fill  in  xn  and  L:  For  all  e  >  0  there  exists  N  such  that  n>  N  implies 
that  |n/(n  +  2)  —  1|  <  £. 

Now  simplify  the  expression  |n/(n  +  2)  —  1 1  <  e  to  find  out  how  big  n  must  be  in 
terms  of  e:  We  need  to  find  N  so  that  n>N  implies  that  2/(n  +  2)  <£. 

“Devising  a  plan.”  You  can  solve  for  n,  as  above,  if  you  wish,  but  that  method 
only  works  well  in  simple  cases.  So  we  are  going  to  try  to  change  this  problem  from 
the  one  we  have  to  a  simpler  problem.  How  will  we  do  that?  Well,  we  want  to  make 
2/{n  +  2)  small.  If  we  can  find  something  bigger  and  simpler  than  2/{n  +  2),  and  if 
we  can  make  that  less  than  e,  then  we  will  also  know  that  2/(n  +  2)  is  less  than  e. 
What’s  bigger  and  simpler?  The  previous  example  wasn’t  too  bad.  So  if  we  just  had 
a  simple  fraction,  we  would  be  in  good  shape.  A  general  strategy  that  often  works  is 
this:  If  the  numerator  is  complicated,  we’ll  try  to  find  something  simpler  and  larger 
than  the  numerator.  If  the  denominator  is  complicated,  we’ll  try  to  find  something 
simpler  and  smaller  than  the  denominator.  Our  simpler  expression  must  still  “act  the 
same  in  the  long  run”  as  the  original. 

For  this  exercise,  the  numerator  is  simple,  so  we’ll  leave  it  alone.  For  the  denom¬ 
inator,  we’ll  try  to  somehow  use  the  thing  that  dominates:  n.  We  need  n+2  greater 
than  or  equal  to  something  simple  involving  n.  It’s  pretty  clear  that  n  +  2  >  n,  so 
we’ll  use  n.  Putting  this  together,  we  have  found  that 

2  2 

- <  -. 

n  +  2  n 

Thus,  if  we  make  2/n  <  £,  we  will  also  make  2 /{n  +  2)  <  e.  But  making  2/n  <  e 
is  easy,  since  2/n  <  e  if  and  only  if  n  >  2/e.  Therefore,  it  appears  that  if  N  >  2/e, 
then  2/n  <  e  and  thus  2/  (n  +  2)  <  e,  which  is  what  we  need. 

“Carrying  out  the  plan.”  Write  out  the  proof,  beginning  with  “If  e  >  0  and _ ” 

The  very  next  phrase  should  identify  N,  unless  there  are  things  you  need  to  tell  the 
reader  in  order  for  the  reader  to  understand  your  definition  of  the  real  number  N. 
Remember  that  the  reader  will  only  see  your  proof,  not  your  plan. 

Proof.  If  e  >  0  and  we  choose  N  =  3/e,  then  for  n>N,  we  have 


n  +  2 


2 

n  +  2 


2  2  2e 

<  -  <  —  =  — 
~  n~  N  3 


<  £. 


Thus,  lim,,^.,*,  n/(n  +  2)  =  1 . 


□ 


Here’s  one  more  exercise  on  limits. 
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Exercise  20.4.  The  sequence  ((2n  +  4)/(n2  +  n+  1))  converges.  Guess  the  limit 
and  prove  that  your  guess  is  correct,  using  the  definition  of  convergence.  Q 


It’s  time  to  think  about  negating  the  definition  of  convergence. 

Exercise  20.5.  By  negating  the  definition  of  convergence,  explicitly  state  what  it 
means  for  a  sequence  (x„)  to  diverge.  Q 

Exercise  20.6.  Using  Exercise  20.5,  show  that  the  sequence  ((—1)")  diverges.  O 

We  state  the  most  basic  properties  of  limits  and  convergent  sequences  here.  The 
first  theorem  says  that  there  can  be  one  and  only  one  choice  for  the  limit  of  a  con¬ 
vergent  sequence. 

Theorem  20.7.  If  a  sequence  converges,  then  the  limit  is  unique. 

We’ve  done  uniqueness  proofs  before  and  we’ll  do  this  one  the  same  way:  We 
suppose  to  the  contrary  that  there  are  two  different  limits  L  and  M,  and  then  we  will 
show  that  they  must  be  the  same.  So,  we  need  to  show  that  L  —  M  =  0.  We  also  use 
a  standard  trick:  we  add  and  subtract  the  same  quantity  to  an  object.  Why?  Well, 
since  all  we  know  is  that  L  and  M  get  close  to  the  terms  of  the  sequence  (x„),  we 
have  to  somehow  use  these  terms.  But  there  is  no  x„  in  the  equation  L  —  M  =  0.  So 
we  will  have  to  insert  an  xn  where  none  appears. 

Proof.  Let  (x„)  be  a  convergent  sequence.  Suppose  to  the  contrary  that  xn  — >  L  and 
x„  — >  M,  where  L  M.  Let  £  =  (1/4)|L  —  M\.  Then  e  >  0.  By  the  definition  of 
convergence,  since  e  >  0,  there  exists  N\  such  that  \xn  —L\  <  £  for  n  >  Ni  and  there 
exists  N2  such  that  |x„  —  M\  <  £  for  n  >  N2.  Let  N  =  max{?Vi , /V2}.  Then  for  n>N 
we  have 


\L  —  M\  =  \L  —  xn+xn—M\ 

<  \L  —  x„\  +  \xn  —  M\  (by  the  triangle  inequality) 

<  e  +  £  (since  n  >  N\  and  n  >  N2) 

=  \\L-M\. 

But  this  is  silly,  since  no  positive  real  number  is  smaller  than  half  of  itself.  This 
contradiction  establishes  the  result  that  limits  of  sequences  are  unique.  □ 

Here’s  another  important  theorem.  It  uses  Exercise  19.3,  which  says  that  a  se¬ 
quence  (x„)  is  bounded  if  and  only  if  there  exists  a  real  number  M  such  that  \xn  \  <  M 
for  all  n. 

Theorem  20.8.  Every  convergent  sequence  is  bounded. 

Proof.  Suppose  that  the  sequence  (x„)“=1  converges  to  the  real  number  L.  Let  £  =  1 . 
Then  there  exists  N  such  that  \xn  —  L\  <  1  for  all  n  >  N.  Let  K  be  the  small¬ 
est  integer  satisfying  K  >  N.  Thus  \xn\  =  \xn  —  L  +  L\  <  \xn  —  L\  +  \L\  <  1  +  \L\ 
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for  all  n  >  K.  Consider  the  real  numbers  \x\  |,  jxil,.  • . ,  |xx-_i|  and  1  +  \L\.  Since 
there  are  finitely  many  such  numbers,  we  may  choose  the  maximum  of  these.  Let 
M  =  max{  I*!  | ,  |jc2  | , . . . ,  \xk - t 1 , 1  +  \L\ }.  Then  |x„  |  <  M  for  all  n,  and  we  conclude 
that  the  sequence  (x„)  is  bounded.  □ 

Part  (i)  of  the  next  theorem  says  that  if  we  sum  two  convergent  sequences,  the 
new  sequence  converges,  too.  It  also  says  “the  limit  of  the  sum  is  the  sum  of  the 
limits.”  What  do  the  other  parts  say? 

Theorem  20.9.  Let  (x„ )  and  (y„)  be  two  sequences  that  converge.  Let  L  and  M  be 
real  numbers  such  that  x„  — >  L  and  y„  — ■>  M.  Then 

(i)  x„  +yn  L  +  M, 

( ii)  axn  — >  aL,  for  every  real  number  CL, 

(iii)  xnyn  — *■  LM,  and 

(iv)  ifM  ^  0  and  y„  ^  Ofor  all  n,  then  1  /yn  — ►  1  /M. 

We  prove  part  (i)  here.  All  the  proofs  are  similar,  and  this  part  will  illustrate 
the  most  important  idea,  which  is  that  we  need  to  choose  things  carefully  to  make 
everything  work  out.  Here’s  what  we  mean:  For  every  e  >  0,  we  need  to  find  N  such 
that  for  n  >N  we  have  |(x„  +yn )  —  (L  +  M)  \  <  e.  We  can  make  |x„  —  L\  <  £  for  n 
large  enough,  and  we  can  make  \yn  —  M\  <  e  for  n  large  enough,  but  if  we  add  these 
together  we  get  2e.  You’ll  now  see  how  we  handle  this  problem. 

Proof.  [Proof  of  (i)]  Let  e  >  0.  Since  e/2  >  0  and  xn  — >  L,  there  exists  /V|  such  that 
|x„  L\  <  e/2  for  n>  N  i .  Again,  since  e/2  >  0  and  yn  —*  M,  there  exists  N2  such 
that  | yn  —M j  <  e/2  for  n  >  Ni-  Let  N  =  max{Ah,?V2}.  Then  for  n  >  N,  we  know 
that  n  >  N\  and  «  >  N2,  so 

|  (.Xn  T  yn)  —  {L  +  M)  I 

=  \{xn-L)  +  {yn-M)\ 

<  \xn  —  L\  +  \yn  —  M\  (by  the  triangle  inequality) 

<e/2  +  e/2  (since  n>  N\  and  n  >  Nf). 

We  conclude  that  for  all  e  >  0  there  exists  N  such  that  for  all  n  >  N  we  have 
| (x„  +yn)  —  {L  +  M) |  <  e,  as  desired.  □ 

This  theorem  can  be  used  to  find  the  limit  of  a  recursively  defined  function  rather 
easily,  if  we  know  that  the  sequence  converges. 

Exercise  20.10.  Define  the  sequence  (x„)  by  x\  =  1  and  x„+i  =  0.0 2x/  +  8  for  all 
n  £  Z+.  You  may  assume  that  the  sequence  (x„)  converges  and  is  bounded  above  by 
20.  Find  lim„  ,rx,x„.  Q 
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Definitions 

Definition  20.1.  We  say  that  a  sequence  (x„)  converges  if  there  exists  a  real  number 
L  such  that  for  all  e  >  0  there  exists  a  real  number  N  such  that  \x„  —  L\  <  e  for  all 
n  >  N.  If  such  an  L  exists,  we  call  L  the  limit  of  the  sequence  (x„),  we  say  that  (x„) 
converges  to  L,  and  we  write  x„  — >  L  or  \\mn-,„xn  =  L. 

Definition  20.2.  A  sequence  diverges  if  it  does  not  converge. 

Definition  20.3  (for  Problem  20.20).  Let  (x„)  be  a  bounded  sequence.  For  each 
positive  integer  n,  let  sn  =  sup{xm  :  m  >  «}.  Then  (s„)  converges  and  its  limit  is 
called  the  limit  superior  of  (x„),  written  limsup(x„). 

Definition  20.4  (for  Problem  20.20).  Let  (x„)  be  a  bounded  sequence.  For  each 
positive  integer  n,  let  t„  =  inf{xm  :  m  >  n}.  Then  (t„)  converges  and  its  limit  is 
called  the  limit  inferior  of  (x„),  written  liminf(x„). 

Definition  20.5  (for  Problem  20.21).  A  sequence  (x„)  of  real  numbers  diverges  to 

infinity  (written  x„  — >  °°)  if  for  every  MeR  there  exists  a  real  number  N  such  that 
n>N  implies  x„  >  M. 

Definition  20.6  (for  Problem  20.21).  A  sequence  is  monotone  if  it  is  an  increasing 
sequence  or  a  decreasing  sequence. 

Definition  20.7  (for  Problem  20.21).  A  sequence  (x„)  is  a  Cauchy  sequence  if  for 

all  e  >  0  there  exists  a  real  number  A  such  that  n,m  >  N  implies  that  |x„  —  xm\  <  £. 

Definition  20.8  (for  Problem  20.22).  Let  (x„)“=1  be  a  sequence  of  real  numbers 
and  let  (n^)  be  a  strictly  increasing  sequence  of  positive  integers.  The  sequence 
(x„k)  is  called  a  subsequence  of  (x„ ) . 


Solutions  to  Exercises 

Solution  (20.1).  The  answers  are  (in  this  order):  3/5,  0,  and  —1. 

Solution  (20.4).  Normally  when  we  write  up  our  solution  we  will  include  the  proof 
and  not  our  work  on  devising  a  plan.  But  one  more  careful  example  here  will  cer¬ 
tainly  be  useful.  So  here’s  our  plan:  We  guess  that  this  converges  to  0,  so  we  need 
to  show  that  for  all  e  >  0,  there  exists  N  such  that  |  (2 n  +  4)  /  (n2  +  n  +  1 )  |  <  e  for 
all  n  >  N.  Now  both  numerator  and  denominator  are  a  bit  complicated.  For  the  de¬ 
nominator,  we  need  to  find  something  smaller  and  simpler  than  n2  +n+  1  involving 
the  highest-order  term  n2.  So  for  this  part,  we  note  that  n2+n  +  1  >  n 2 .  Now  for  the 
numerator,  we  need  to  find  something  larger  and  simpler  than  2«  +  4  involving  the 
highest-order  term  n.  For  n>  1,  since  4  <  An,  we  see  that  2n  +  4  <  2n  +  An  =  6 n. 
Putting  this  together,  for  n  >  1,  we  have 


230 


2n  +  4 

|2  4-  f?  4- 
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n2  +  n+  1 


6n  6 

<-2  =-■ 
n z  « 


So,  if  we  make  6/n  <e,  we  should  be  able  to  complete  the  proof.  Thus,  we’ll  choose 
N  =  7/e. 

Proof.  If  e  >  0,  we  choose  N  =  1  /e.  Then  for  n  >  N,  we  have 


2n  +  4 


n2  +  n  +  1 


-0 


6  n  6  6  6e 

~n2~n~N~  T<e’ 


where  the  first  inequality  follows  since  «  >  1  and,  consequently,  2/3  4-  4  <  6;;.  Thus 
((2n+4)/(n2  +  3i+  1))  converges  to  0.  □ 

Solution  (20.5).  A  sequence  (x„)  diverges  if  for  every  real  number  L  there  exists 
£  >  0  such  that  for  all  A  G  R  there  exists  n>N  with  \xn  —  L\  >  e. 

Solution  (20.6). 

Proof.  Let  L  be  a  real  number,  and  let  £  =  1  /2.  We  break  this  into  two  cases.  First 
suppose  that  L  <  0.  Let  N  £  R.  and  choose  n  to  be  an  even  integer  satisfying  n>N. 
Then  \xn  —  L\  =  1 1  —  L\  >  1  >  £.  Now  if  L  >  0,  then  for  N  £  R  choose  an  odd  integer 
with  n  >  N.  It  follows  that  \x„  —  L\  =  |  —  1  —  L\  =  |1  +L|  >  1  >  £.  Therefore  (x„) 
diverges.  □ 

Solution  (20.10).  First  note  that  limn^ooXn+i  =  lim„  .mx„.  This  is,  as  you  should 
check,  an  immediate  consequence  of  the  definition  of  the  limit  of  a  sequence.  Since 
L  =  I i m„  x„  exists.  Theorem  20.9  implies  that  lim„_.„,(x„)2  exists  and  is  Lr.  Fur¬ 
ther,  and  again  using  Theorem  20.9,  we  get 

L  =  lim  x„+i  =  lim  (0.02x,^  +  8)  =  0.02  L2  +  8. 

We  solve  this  quadratic  equation  and  get  L  =  10  or  L  =  40.  Since  (x„ )  is  bounded 
above  by  20,  we  conclude  that  \imn^ooXn  =  10. 

Using  induction,  you  can  prove  that  (x„ )  is  strictly  increasing  and  bounded  above 
by  20.  Prove  it!  Once  you’ve  done  this.  Theorem  20.11  of  Problem  20.17  below 
implies  that  (x„)  converges.  This  justifies  the  hypothesis  that  we  told  you  to  assume. 


Problems 

Problem  20.1.  For  each  of  the  following,  give  an  example  that  satisfies  all  require¬ 
ments  or  prove  that  no  such  example  exists: 

(a)  a  divergent  sequence  that  is  bounded; 

(b)  an  increasing  sequence  (xn)  with  lim^ooXn  =  9  and  lim„_>00(— 2 x„)  =  —8; 
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(c)  a  bounded  increasing  sequence  (x„)  such  that  x„  /  0  for  all  n  and  such  that 
(l/x„)  diverges. 

Problem#  20.2.  We  used  the  following  several  times  in  this  chapter:  Let  x.y.z.  £  R. 
Then  \x  —  y\  <  \x  —  z\  +  |y  —  z\.  Prove  this  statement. 

Problem  20.3.  Let  a  and  8  be  real  numbers  with  8  >  0.  Show  that  for  all  real  num¬ 
bers  x,  we  have  |x  —  a\  <  8  if  and  only  if  a  —  8  <  x  <  a  +  8 . 


Problem  20.4.  For  each  of  the  following,  guess  the  limit  and  then  prove  (using  the 
definition  of  convergence)  that  your  guess  is  correct: 


(a) 

(b) 

(c) 

(d) 

(e) 

(f) 

(g) 

(h) 


1 

lim  — ; 

«-»°°  3 n 

lim  —=; 

n—>°°  Jn 


lim 

n—>oo 

lim 

n— >oo 

lim 

lim 

ft— 

lim 

lim 


1 

Vn  +  l’ 
n2+  4 


n 

2n+l 

n  +  2  ’ 
3 


(n  +  7)!; 

3«2  + 1 
4  n2  +  n  +  2 


Problem  20.5.  Let  (x„)  and  (y„ )  be  convergent  sequences.  Use  the  definition  of 
convergence  (no  limit  theorems!)  to  prove  that  the  sequence  (3x„  —  2 yn)  converges. 

Problem  20.6.  This  is  a  continuation  of  Problem  19.5.  We  will  use  all  the  notation 
that  was  introduced  there. 

(a)  Find  an  explicit  formula  for  the  terms  Sn  of  the  sequence  (S„).  (You  may  need 
the  formula  for  a  geometric  sum,  see  Problem  19.4.) 

(b)  Guess  limn^oo  S„  and  then  prove  that  your  guess  is  correct. 

(c)  Find  lim„^oo  sn . 

(d)  Find  the  minimum  and  maximum  amount  of  phenytoin  in  the  patient’s  blood 
after  one  week  and  after  one  month  (60  administrations  of  the  drug). 

(e)  Find  the  minimum  and  maximum  amount  of  phenytoin  in  the  patient’s  blood 
in  the  long  run  (that  is,  as  n  — >  °°). 

Problem  20.7.  Prove  Theorem  20.9  part  (ii). 

Problem  20.8.  (a)  Suppose  that  (x„)  and  (y„ )  are  sequences  and  0  <  x„  <  y„  for 

all  positive  integers  n.  Show  that  if  yn  — >  0,  then  x„  —  0. 
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(b) 

(c) 


Suppose  that  (x„)  and  (y„)  are  sequences  and  —yn  <  xn  <  yn  for  all  positive 
integers  n.  Show  that  if  yn  — >  0,  then  xn  -  0. 
sin2  n  ( — ]  1 n 

Find  lim - and  lim  — — and  prove  that  your  answers  are  correct. 

n-> oo  n  nz  +  1 


Problem  20.9.  Redo  Problem  20.4,  parts  (a),  (c),  (d),  (e),  (g),  and  (h)  using  theorems 
in  this  chapter  or  Problem  20.8. 


Problem  20.10.  Let  (x„)  be  a  convergent  sequence  defined  recursively  by  xi  =  50 
and  x„+i  =  \xn  +  5  for  n  >  1.  Without  finding  an  explicit  form  of  xn,  calculate 
lim„_>ooXn.  Justify  all  steps  in  your  calculation. 

Problem  20.11.  Claim:  Let  (y„)  be  a  sequence  defined  recursively  by  y  i  =  5  and 
yn+ 1  =  3y„  6  for  n  >  1.  Then  lim,,-,,*,);,,  =  3. 

Proof?  Applying  Theorem  20.9, 


lim  yn+ 1  =  3  lim  yn  —  6. 

n — >°°  n— >oo 

Hence  21im„^00)„  =  6.  This  implies  that  lim„_ioo  yn  =  3.  0 

If  this  proof  is  correct,  fill  in  the  details,  citing  the  theorems  that  are  applied.  If 
this  proof  is  incorrect,  show  exactly  where  an  incorrect  conclusion  is  drawn.  Is  the 
claim  correct?  Explain. 

Problem  20.12.  (a)  Show  that  for  every  sequence  (x„)  we  have 

0  <  \xn\  +xn  <  2|x„|. 

(b)  Prove  that  if  |x„|  — >  0,  then  x„  — »  0.  (See  Problem  20.8.) 

(c)  If  (x„)  is  a  sequence  such  that  |x„  — >  1,  must  x„  — >  1? 

Problem  20.13.  Prove  Theorem  20.9  part  (iii).  (Hint:  You  may  want  to  use  Theo¬ 
rem  20.8.) 

Problem  20.14.  The  proof  of  Theorem  20.9  part  (iv)  is  outlined  below. 

(a)  Prove  that  for  real  numbers  y  and  M,  if  \y  —  M\  <  |M|/2,  then  |y|  >  \M\/2. 
(You  might  wish  to  use  the  lower  triangle  inequality  to  establish  this  implica¬ 
tion.) 

(b)  Let  (y„)  be  a  sequence  of  nonzero  real  numbers,  and  suppose  that  y„  — >  M, 
where  M/0.  Prove  that  if  0  <  £  <  \M\/2,  then  there  exists  N  such  that  for 
n  >  IV  if  \yn—M\  <  e,  then  \(M -yn)/(Myn)\  <  (2/M2)|M-y„|. 

(c)  Prove  Theorem  20.9  part  (iv). 

Problem  20.15.  Let  a  be  a  real  number  satisfying  0  <  a  <  1. 

(a)  Show  that  there  exists  a  real  number  x  such  that  x  >  0  and  a  -  1/(1+ x). 

(b)  Show  that  a'1  <  1/(1  +  nx)  for  all  n  G  N.  (You  will  need  to  do  Problem  18.6 
if  you  have  not  already  done  it.) 
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(c)  Show  carefully  that  1/(1  +  nx)  — >  0. 

(d)  Show  that  an  — >  0.  (You  will  want  to  do  Problem  20.8  if  you  have  not  already 
done  it.) 

(e)  Show  that  if  xn  =  0.999. ..9,  then  x„  — >  1 . 

n  9's 

Problem  20.16.  Let  xn  =  Fn+\/Fn,  where  F„  denotes  the  nth  Fibonacci  number.  In 
Problem  19.19  we  showed  that  x„  =  1  +  1  /x„_ i .  Assume  further  that  there  exists  a 
nonzero  real  number  L  such  that  xn  — >  L.  Explain  why  1  /xn  — >  1/L  and  use  these 
facts  to  compute  L. 

The  number  L  is  the  golden  ratio,  which  appears  frequently  in  architecture  and 
in  nature.  The  Greeks,  and  others,  felt  (and  still  feel)  that  rectangles  with  sides  in 
golden  ratio  are  the  most  beautiful. 

Problem  20.17.  (An  exercise  in  reading  and  writing.)  The  combination  of  the  two 
theorems  in  this  problem  is  usually  called  the  monotone  convergence  theorem  for 
sequences. 

(a)  Read  the  proof  below  until  you  understand  it.  Mathematicians  often  read 
proofs  many  times,  and  you  may  have  to  do  so  with  this  one. 

Theorem  20.11.  Every  increasing  bounded  sequence  converges  to  its  supremum. 

Proof.  Let  l  =  sup(x„),  and  let  e  >  0.  Since  /  —  e  is  not  an  upper  bound,  there  exists 
N  such  that  x^  >  l  —  e.  We  have  assumed  that  (x„)  is  an  increasing  sequence.  There¬ 
fore,  if  n  >  N,  we  know  that  x„  >  xn  >  l  —  £■  Since  x„  <  l  for  all  n,  for  n  >  N  we 
have  l  —  e<xn<I  +  £,  and  thus  \xn  —  l \  <£.  Therefore,  the  sequence  (x„)  converges 
to  /.  □ 

(b)  Use  the  ideas  in  the  proof  above  to  prove  Theorem  20.12. 

Theorem  20.12.  Every  decreasing  bounded  sequence  converges  to  its  infimum. 

(c)  Can  you  find  another  proof  of  Theorem  20.12,  this  time  using  the  statement 
of  Theorem  20. 1 1  rather  than  its  proof? 

Problem  20.18.  Use  Problems  20.15  and  20.17  to  prove  your  guess  of  Exercise  19.9, 
namely,  that  if  x„  =  0.999.  ..9,  then  sup(x„)  =  1. 

n  9's 

Problem  20.19.  This  problem  takes  up  the  situation  and  notation  of  Problem  19.5. 

(a)  Use  your  work  of  parts  (b),  (c),  and  (d)  of  Problem  19.5  and  the  theorems  of 
Problem  20.17  to  find  sup(S„)  and  sup(s„). 

(b)  Use  part  (a)  to  find  the  range  of  the  amount  of  phenytoin  in  the  patient’s  blood 
in  the  long  run  (that  is,  as  n  — >  °°).  (Note  that  though  this  is  the  same  question 
as  in  Problem  20.6  part  (e),  the  solution  does  not  require  the  explicit  form  of 
the  sequence  (S„)\) 
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Problem  20.20.  Use  the  theorems  of  Problem  20.17  for  the  following. 

(a)  Show  that  («!/«")  converges. 

(b)  Let  (x„)  be  a  bounded  sequence.  For  n  £  Z+,  let  s„  =  sup{x,„  :  m  >  n}.  Prove 
that  (s„)  converges.  The  limit  of  (s„)  is  called  the  limit  superior  of  (x„),  and 
is  usually  denoted  by  limsup(x„).  What  is  the  limsup((— 1)")? 

(c)  Let  (x„)  be  a  bounded  sequence.  For  n  £  Z+,  let  tn  =  inf{xm  :  m  >  n}.  Prove 
that  (f„)  converges.  The  limit  of  (tn)  is  called  the  limit  inferior  of  (x„),  and 
is  usually  denoted  by  liminf(x„). 

Problem  20.21.  Sequences  afford  an  excellent  opportunity  to  practice  everything 
you  have  learned.  That’s  what  you’ll  do  in  this  problem:  For  each  of  the  definitions 
below,  do  the  following. 

1.  Read  the  definition. 

2.  Try  to  find  an  example  of  something  that  illustrates  the  definition. 

3.  Try  to  find  an  example  of  something  that  does  not  satisfy  the  defining  conditions. 

4.  Write  the  definition  in  symbols. 

5.  Negate  the  definition. 

(a)  A  sequence  (x„)  of  real  numbers  diverges  to  infinity  (written  xn  — >  °°)  if  for 
every  Mel  there  exists  a  real  number  N  such  that  n>N  implies  x„  >  M. 

(b)  A  sequence  is  monotone  if  it  is  an  increasing  sequence  or  a  decreasing  se¬ 
quence. 

(c)  A  sequence  (xn)  is  a  Cauchy  sequence  if  for  all  e  >  0  there  exists  a  real 
number  A  such  that  n,m  >  N  implies  that  \xn  —xm\  <  £. 

In  addition,  for  this  one,  pretend  you  were  talking  to  a  high  school  student 
who  loves  mathematics  and  just  has  to  know  what  a  Cauchy  sequence  is  but 
has  never  heard  of  e  and  N.  What  would  you  tell  him  or  her? 

Problem  20.22.  Consider  the  following  definition. 

Let  (x„)~=1  be  a  sequence  of  real  numbers  and  let  (n^)  be  a  strictly  increasing 
sequence  of  positive  integers.  The  sequence  (xn. )  is  called  a  subsequence  of  (x„). 
For  this  problem,  do  all  the  things  you  did  in  Problem  20.21,  plus  (a),  (b),  and  (c) 
below. 

(a)  This  definition  says  that  when  you  choose  a  subsequence,  you  must  do  two 
things:  you  need  to  list  all  the  x„  in  order  of  appearance,  and  then  you  obtain 
the  x„k  by  choosing  one  element  after  the  other  from  your  list,  making  sure 
that  the  term  you  choose  comes  after  the  one  you  just  chose.  How  does  the 
definition  tell  you  that  you  must  choose  from  this  list?  How  does  it  tell  you 
that  you  must  choose  in  order  of  increasing  appearance? 

(b)  Let  xn  =  1  /n.  Ts  (x„ )  a  subsequence  of  itself?  If  yn  =  X2„ ,  give  a  formula  for 
yn  in  terms  of  n.  If  z.n  =  x«+4,  give  a  formula  for  zn  in  terms  of  n. 

(c)  How  can  you  tell  that  the  sequence  (w„)  given  by  w„  =  (— 1  )"/n  is  not  a 
subsequence  of  (x„)  as  defined  in  (b)? 

Problem  20.23.  In  Problem  20.21  part  (c)  we  defined  a  Cauchy  sequence.  Show 
that  every  convergent  sequence  is  a  Cauchy  sequence. 
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If  you  were  asked  how  many  people  are  in  your  class,  you  would  do  the  natural 
thing  and  count  them.  Thinking  about  this  carefully,  we  see  that  what  you  are  doing 
is  assigning  each  person  in  the  room  one  and  only  one  number.  If  we  asked  whether 
or  not  there  are  more  people  in  this  class  than  were  in  your  high  school  geometry 
class,  you  could  certainly  answer  that  question  by  comparing  the  two  numbers. 

Now  suppose  we  look  at  the  positive  integers  and  the  natural  numbers.  Which  set 
has  more  elements?  What  should  that  even  mean?  This  is  a  more  difficult  question  to 
answer  correctly  than  you  might  think.  The  following  mathematical  folktale  (often 
attributed  to  Hilbert)  illustrates  the  problem  and  a  solution. 

Suppose  there  is  a  hotel  with  infinitely  many  rooms.  The  hotel  is  completely 
booked  when  the  coach  of  a  Davis  Cup  team  arrives.  The  clever  manager  accom¬ 
modates  her  by  moving  all  of  the  other  guests  to  the  room  numbered  one  higher 
than  the  room  they  previously  occupied,  which  clears  the  first  room  for  the  coach. 
Then  the  four  members  of  the  team  arrive.  Each  must  have  his  own  room,  of  course. 
The  very  clever  manager  moves  everyone  up  four  rooms,  making  enough  room  for 
the  four  athletes.  Finally,  the  team’s  infinitely  many  fans  arrive  (this  happens  all  the 
time  at  really  good  hotels).  The  very,  very  clever  manager  accommodates  all  guests 
by  moving  the  residents  of  room  number  n  to  the  room  with  number  2 n.  Now  all  the 
new  people  can  go  in  the  odd-numbered  rooms. 

An  interesting  commentary  of  this  problem  was  given  by  Smilla  in  the  book 
Smilla’s  Sense  of  Snow.  She  says,  “What  delights  me  about  this  story  is  that  every¬ 
one  involved,  the  guests  and  the  owner,  accept  it  as  perfectly  natural  to  carry  out  an 
infinite  number  of  operations  so  that  one  guest  can  have  peace  and  quiet  in  a  room 
of  his  own.  That  is  a  great  tribute  to  solitude.”1  [53,  p.  11]. 

What  does  “Hotel  Infinity”  really  show  us?  It  is  our  aim  in  the  next  few  sections 
to  discuss  and  answer  these  questions. 

To  make  precise  what  it  means  for  two  sets  (even  two  infinite  sets)  to  have  the 
same  number  of  elements,  we  need  a  definition.  We  say  that  a  set  A  is  equivalent  to 

1  Excerpt  from  Smilla’s  Sense  of  Snow  by  Peter  H0eg,  translated  by  Tiina  Nunnally.  Translation 
copyright  ©1993  by  Farrar,  Straus  and  Giroux,  LLC.  Reprinted  by  permission  of  Farrar,  Straus 
and  Giroux,  FFC. 
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a  set  B  if  there  exists  a  bijection  /  :  A  — >  B.  We  write  AkB  for  A  is  equivalent  to  B. 
(Other  authors  use  the  words  equipotent  or  equinumerous.) 

You  actually  know  a  lot  about  this  concept  from  previous  chapters  (particularly 
Chapter  15).  The  next  result  summarizes  information  we  already  have. 

Theorem  21.1.  Let  stf  be  a  nonempty  collection  of  sets.  Equivalence  between  ele¬ 
ments  A  and  B  of  sd ,  as  defined  by  A  ss  B  above,  is  an  equivalence  relation  on  srt 


You  will  be  asked  to  prove  this  theorem  in  Problem  21.5. 

Example  21.2.  Show  that  the  open  interval  (0, 1)  is  equivalent  to  the  open  interval 
(0,3). 

Proof  Define  a  function  / :  (0, 1)  — >  (0, 3)  by  f(x)  =  3x.  We  leave  it  to  you  to  show 
that  this  function  is  bijective.  Thus  (0, 1)  ss  (0,3).  □ 

Since  the  relation  « is  symmetric,  we  will  usually  say  that  A  and  B  are  equivalent, 
rather  than  A  is  equivalent  to  B.  This  concept  allows  us  to  give  a  precise  definition 
of  a  finite  set.  We  say  that  a  set  S  is  finite  either  if  S  =  0  or  if  S  is  equivalent  to 
the  set  {1,2,3, . . .  ,n}  for  some  positive  integer  n.  Thus,  to  prove  that  a  nonempty 
set  is  finite,  we  need  to  find  a  bijection  between  S  and  a  set  {1,2,3, ... ,«}  for  some 
n  £  Z 1 .  Since  the  relation  is  symmetric,  either  set  can  serve  as  the  domain.  The 
bijection  is  the  mathematical  analog  of  what  we  usually  describe  as  counting.  A  set 
is  said  to  be  infinite  if  it  is  not  finite. 

Example  21.3.  By  negating  the  definition  of  finite,  say  what  it  means  for  a  set  to  be 
infinite. 

A  set  S  is  infinite  if  it  is  nonempty  and  for  every  positive  integer  n  there  does  not 
exist  a  bijection  from  S  to  the  subset  {1,2,3, ...  ,n}.  O 

What  are  some  examples  of  finite  sets?  By  our  definition,  every  set  of  the  form 
{1,2,3,. . .  ,n}  is  a  finite  set.  What  about  a  set  like  {2,4,6}?  It  certainly  feels  finite, 
but  to  prove  that  it  is  finite  we  would  have  to  construct  a  bijective  function.  It  is  easy 
here;  the  function  /  :  {1,2,3}  — >•  {2,4,6}  defined  by  /(n)  =  2 n  certainly  works.  You 
could  also  define  a  function  g  on  {1,2,3}  by  g(l)  =  4,g(2)  =  6,  and  g(3)  =  2.  In 
fact,  if  our  set  has  more  than  one  element,  then  there  is  more  than  one  choice  for  the 
bijection. 

Exercise  21.4.  Show  that  the  set  {6,8, 10, 14}  is  finite.  O 

The  rest  of  this  chapter  is  a  paraphrasing  of  much  of  the  work  we  did  in  Chap¬ 
ters  14  through  17.  We  isolate  the  important  ideas  below  and  we  guide  you  through 
the  proofs,  but  you  have  all  the  techniques  to  prove  everything  yourself. 

Theorem  21.5.  The  sets  Z  and  N  are  equivalent. 
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This  theorem  is  really  the  essence  of  the  story  behind  “Hotel  Infinity.”  An  infinite 
set,  Z,  can  have  a  proper  subset,  N,  that  has  the  “same  number  of  elements”  in  it! 

Proof.  Define  /  :  Z  — >  N  explicitly  as  follows: 

,,  ,  _  f  2x  if  x  >  0 

''  'X  {  —  (1  +  2x)  otherwise 

In  Example  15.7,  we  showed  that  this  function  is  bijective.  From  this  we  conclude 
that  Z  ss  N.  □ 


The  same  techniques  that  were  used  to  prove  Theorem  21.5  can  be  used  to  prove 
the  next  theorem. 


Theorem  21.6.  Let  A,B,C,  and  D  be  nonempty  sets.  If  A  n  =  0,  C  fl  Z)  =  0,  AkC, 
and  B  &  D,  then  A  U  B  ss  C  U  D. 


Proof.  [Outline  of  proof]  Since  A  ss  C,  there  exists  a  bijective  function  /  :  A  — >  C. 
Similarly,  since  B  «  D,  there  exists  a  bijective  function  g  :  B  — >  D.  We  define  a 
function  H  :  AUB  — >  CUD  in  cases  by 


H{x) 


f(x)  if  x  €  A 
g(x)  if  x  £  B 


We  leave  it  to  you  to  show  that  ZZ  is  well-defined  and  bijective.  □ 

Exercise  21.7.  What  happens  to  our  proof  if  A  C\B  0?  If  we  do  not  assume  that 
A  fl  B  =  0,  is  the  theorem  still  true?  Q 


For  finite  sets.  Theorem  21.6  has  an  interesting  consequence. 

Corollary  21.8.  Let  A  and  B  be  disjoint  sets.  If  A  and  B  are  finite,  thenAUB  is  finite. 


The  proof  of  Corollary  21.8  is  left  to  you  in  Problem  21.9. 

Since  the  definition  of  equivalent  sets  uses  bijective  functions,  many  of  the  theo¬ 
rems  you  have  had  will  come  in  quite  handy.  If  you  have  forgotten  the  definition  of 
the  restriction  of  a  function  (see  Definition  15.4),  you  will  want  to  review  it  before 
proceeding.  We’ll  use  restrictions  in  many  of  the  proofs  about  finite  and  infinite  sets. 

Theorem  21.9.  Let  n  be  a  positive  integer.  Then  every  subset  of  {1,2,3...../;}  is 
finite. 

Proof.  The  proof  will  be  by  induction  on  n. 

If  n  =  1,  then  our  set  is  { 1 },  and  there  are  only  two  subsets;  { 1 }  and  0.  Therefore, 
the  result  holds  if  n  =  1. 

We  now  turn  to  the  induction  step.  So  let  n  £  Z+  and  suppose  that  every  subset  of 
{1,2, . . .  ,n}  is  finite.  We  must  show  that  every  subset  of  {1,2, . . .  ,n,n+  1}  is  finite. 
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So  consider  the  set  {1,2, . .  .,n,n+  1},  and  a  subset  S  of  this  set.  If  S  C  {1,2, . . . ,«}, 
then  S  is  finite  by  our  induction  hypothesis.  Otherwise,  n  +  1  £  S.  In  this  case,  notice 
that  {n  +  1 }  is  a  finite  set.  Since  S\  {n  +  1}  C  {1 we  also  know  from  the 
induction  hypothesis  that  S\{n+  1}  is  finite.  Applying  Corollary  21.8  we  may 
conclude  that  S  =  (S  \  {n  +  1})  U  {n  +  1 }  is  finite,  completing  the  proof.  □ 

Corollary  21.10.  Let  S  be  a  finite  set.  Then  every  subset  of  S  is  finite. 

Proof.  If  S  is  empty,  the  result  is  clear.  So  suppose  that  S  is  nonempty,  and  let  T 
denote  a  subset  of  S.  Again,  we  may  assume  that  T  is  nonempty. 

By  our  assumption  there  exists  n  £  Z+  and  a  bijection  /  :  S  — ►  {1,2, . . .  ,n}.  By 
Problem  15.19,  the  restriction  function,  /|y,  is  a  bijective  mapping  from  T  onto 
a  nonempty  subset  B  of  {1,2, .. .  ,n}.  From  Theorem  21.9,  the  set  B  is  finite,  and 
therefore  there  exists  a  positive  integer  m  and  a  bijection  g  :  B  — »  {1,2, .. . .  m } .  By 
Theorem  16.6,  the  composition  h  =  go  ( f\T )  of  the  two  bijective  functions  g  and 
f\ t  is  a  bijection  of  T  onto  {1 , 2, . . . , m}.  Thus  T  is  finite,  completing  the  proof.  □ 

We  assumed  in  Corollary  21.8  that  the  finite  sets  A  and  B  were  disjoint,  but 
our  intuition  tells  us  that  the  union  of  two  finite  sets  should  be  finite.  How  do  we 
prove  this?  The  idea  is  that  the  union  of  two  sets  can  be  expressed  as  the  union  of 
two  disjoint  sets:  the  intersection  appears  in  the  union  twice,  so  to  speak,  so  if  we 
remove  it  once  (from  one  of  the  sets),  we  haven’t  changed  the  union.  That  is  the  key 
to  the  next  result. 

Theorem  21.11.  The  union  of  two  finite  sets  is  finite. 

Proof.  Let  A  and  B  denote  the  two  finite  sets.  Now  you  have  already  shown  (in 
Problem  7.14)  that 

A  UB  =  (A\B)\JB, 

and  it  should  be  clear  that  these  two  sets  are  disjoint.  By  Corollary  21.10,  the  set 
A  \B  is  finite.  The  set  B  is  finite  by  assumption.  Since  these  are  two  disjoint  sets,  we 
have  written  A  UB  as  the  disjoint  union  of  two  finite  sets  and  Corollary  21.8  now 
implies  that  A  U  B  is  finite.  □ 

Note  that  in  the  theorem  above,  we  showed  that  a  set  is  finite  without  exhibiting  a 
specific  bijection.  We’ll  be  building  up  many  results  that  are  useful,  and  we  will  not 
always  go  back  to  the  definition  to  see  how  to  prove  things.  This  is  a  great  plus — 
proofs  become  shorter,  and  sometimes  prettier  and  more  interesting.  Of  course,  to 
use  the  theorems,  you  also  have  to  know  what  they  are! 

We  conclude  this  chapter  with  a  useful  exercise. 

Exercise  21.12.  Use  induction  to  prove  the  following.  Let  m  £  Z+.  If  ,4 1  ./L, . . .  ,Am 
are  finite  sets,  then  the  union  {JT=  i  A/  is  finite. 
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Definitions 

Definition  21.1.  A  set  A  is  equivalent  to  a  set  B  if  there  exists  a  bijection  /  :  A  — >  B. 
We  write  A  «  B  for  A  is  equivalent  to  B. 

Definition  21.2.  A  set  S  is  finite  if  either  S  =  0  or  if  S  is  equivalent  to  the  set 
{1,2,3,...,  n}  for  some  positive  integer  n . 

Definition  21.3.  A  set  is  infinite  if  it  is  not  finite. 


Solutions  to  Exercises 

Solution  (21.4).  We  define  /:  {6,8, 10, 14}  ->  {1,2, 3,4}  by  /( 6)  =  1,  /( 8)  =  2, 
/(10)  =  3,  and  /( 14)  =4.  It  is  clear  that /is  abijective  function,  hence  {6,8, 10, 14} 
is  finite. 

Solution  (21.7).  If  A  and  B  are  not  disjoint,  the  function  H  may  not  be  well-defined. 
In  addition,  if  A  and  B  are  not  disjoint,  the  conclusion  of  the  theorem  may  not  hold. 
To  see  this,  take  finite  sets  with  A  =  B  =  {1},  C  =  {2},  and  D  =  {3}. 

Solution  (21.12).  We  will  prove  this  statement  by  induction  on  m.  For  the  base  step 
( m  =  1),  we  get  (J  /=i  A  y  =  A  i,  which  is  finite  by  assumption. 

Now  we  proceed  to  the  induction  step.  Let  m  G  Z+  and  suppose  that  (J}"  i  A  j  is 
finite.  We  need  to  show  that  UT/|'  4/  is  finite.  But  Aj  =  (U”LiA;)  UAm+i. 
By  the  induction  hypothesis,  Uy=i  Aj  is  finite,  and  Am+\  is  assumed  to  be  finite.  By 
Theorem  21.11,  the  union  of  two  finite  sets  is  finite.  Thus  (J™  V  4  ,  is  finite.  The 
result  now  follows  from  the  principle  of  mathematical  induction. 


Problems 

Problem  21.1.  Show  that  the  following  intervals  of  real  numbers  are  equivalent: 

(a)  [0,1]  and  [0,2]; 

(b)  [0,1]  and  [2,5], 

Problem  21.2.  Prove  that  { 1 , . . . ,  10}  x  { 1 , . . . ,  15}  is  finite  using  only  the  definition 
of  a  finite  set;  that  is,  write  down  the  relevant  bijection  explicitly. 

Problem  21.3.  Explain,  in  words,  the  difference  between  a  “finite  union  of  sets”  and 
a  “union  of  finite  sets.”  Give  examples  of  each  that  show  these  really  are  different. 

Problem*  21.4.  (a)  Show  that  the  set  Q+  of  positive  rationals  and  the  set  Q  of 

negative  rationals  are  equivalent. 
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(b)  Show  that  the  set  of  even  integers  and  set  of  odd  integers  are  equivalent. 
Problem  21.5.  Prove  Theorem  21.1. 

Problem  21.6.  (a)  Prove  that  (0,1)  «  EL  (If  you  choose  to  use  Problem  15.13, 

make  sure  you  solve  that  problem  too!) 

(b)  Prove  that  R  ss  R+. 

Problem  21.7.  (a)  Show  that  Z  «  2Z. 

(b)  Using  theorems  from  this  chapter  (don’t  define  functions!)  show  that  2Z  ss  N. 

Problem  21.8.  Prove  Theorem  21.6  working  with  the  outline  given  in  the  text. 

Problem  21.9.  (a)  Suppose  that  A  and  B  are  nonempty  finite  sets  and  A  flfi  =  0. 

Show  that  there  exist  positive  integers  n  and  m  such  that  ,4  «  {1.2,..../;}  and 
B  k,  {n  +  1, . . .  ,n  +  m}. 

(b)  Prove  Corollary  21.8. 

Problem  21.10.  Prove  Theorem  21.13  below.  We  suggest  that  you  start  by  working 
Problem  16.17  if  you  have  not  already  done  so. 

Theorem  21.13.  Let  A,  B,C,  and  D  be  nonempty  sets  with  A  «  C  and  B  «  D.  Then 
A  x  B  «  C  x  D. 

Problem  21.11.  Prove  the  following  corollary  of  Theorem  21.13  above. 

Corollary  21.14.  Let  A  and  B  be  finite  sets.  Then  Ax  B  is  a  finite  set. 

Problem  21.12.  Let  g/  =  {[a,b)  :  a,b  and  a  <  b}  be  the  collection  of  bounded 
half-open  intervals  of  real  numbers. 

(a)  Prove  that  R+. 

(b)  Prove  that  si  ss  R  x  R. 

Problem  21.13.  For  j  £  Z+,  let  Aj  C  {1 , . . . ,  j}.  Suppose  that  for  some  n  £  Z+,  we 
have  B  C  |F=i  Aj.  Is  B  necessarily  finite?  Prove  it  or  give  a  counterexample. 

Problem  21.14.  Let  A  be  a  subset  of  the  set  of  all  functions  /  :  R  — >  R  and  let 
g  :  R  — >  R  be  a  bijective  function.  We  use  the  notation  gAg^1  =  {gofog^1  :  f  £  A}. 
Prove  that  A  «  gAg~ 1 . 

Problem  21.15.  Prove  that  ((Z+  x  N)  U  (Z_  x  N))  wZxZ. 

Problem  21.16.  By  Theorem  2 1 .5,  Z  ss  N  and  hence  there  is  a  bijection  g :  Z  — >  N. 

(a)  We  define  G:ZxZ— *  N  x  N  by  G(n,m)  =  ( g(n),g(m )).  Prove  that  G  is  a 
bijection  from  Z  x  Z  to  N  x  N. 

(b)  Explain  why  part  (a)  shows  that  Z  x  Z  ss  N  x  N. 

(c)  Show  that  ZxZ«NxN  using  theorems  from  this  chapter  and  without  con¬ 
structing  a  bijective  function. 
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Problem  21.17.  Let  A  be  a  nonempty  finite  set  of  integers  and  let  Jzf  be  the  set  of  all 
linear  polynomials  with  coefficients  in  A;  that  is,  Jz?  =  { ax  +  b  :  a  £  A  and  b  £  A}. 
Prove  that  Jzf  is  finite. 


Problem  21.18.  Let  A  =  {{a\,a2,a^1a^,a^)  :  aj  £  {0,1}  for  j  £  Z  and  1  <  7  <  5}. 
Prove  that  ^({1,2, 3, 4, 5})  «  A. 

Problem  21.19.  Let  n  £  Z+  and  define  A„  =  {(«!,. . .  ,a„)  :  aj  £  {0, 1}  for  j  £  Z 
and  1  <j<  n}.  Prove  that  for  all  n  £  Z+  we  have  £P({1 ss  A„.  (See  Prob¬ 
lem  21.18.) 

Problem  21.20.  Prove  that  (0,°°)  «  [0,°°). 

Problem  21.21.  Prove  that  (0,1)  «  [0, 1]. 


Chapter  22 

Finite  Sets  and  an  Infinite  Set 


We  have  proved  that  sets  are  finite,  but  we  have  not  yet  shown  rigorously  that  a  set 
is  infinite.  It  is  not  as  easy  as  you  might  think  to  do  so,  nor  do  we  have  an  exact 
notion  of  what  it  means  for  a  finite  set  “to  have  n  elements.”  Our  proof  of  the  former 
and  the  definition  of  the  latter  will  depend  on  a  principle  known  as  the  pigeonhole 
principle.  The  pigeonhole  principle  is  something  that  is  familiar  to  all  of  us.  As  a 
very  simple  example  of  this,  recall  a  childhood  birthday  party  in  which  you  played 
musical  chairs.  In  case  you  weren’t  invited  to  any  parties,  we’ll  remind  you  of  the 
rules  behind  the  game.  Let’s  say  there  were  10  children  at  the  party.  Someone,  say 
the  child’s  father,  would  set  up  9  chairs  in  a  row.  Someone  else,  say  the  child’s 
mother,  would  play  a  song  on  the  piano  stopping  unexpectedly  at  some  point.  When 
the  music  ended,  the  10  children  would  scramble  for  the  9  seats.  If  everyone  sat 
down,  two  people  would  sit  in  the  same  chair.  This  game  is  our  first  example  of  the 
pigeonhole  principle.  Now  we  turn  to  a  more  elaborate  one. 

Theorem  22.1.  Suppose  that  n  people  (n  >  2)  are  at  a  party.  Then  there  exist  at 
least  two  people  at  the  party  who  know  the  same  number  of  people  present. 

First  you  need  to  know  the  rules.  We  will  assume  that  no  one  knows  him-  or 
herself.  We  will  also  assume  that  if  x  claims  to  know  y,  then  y  also  knows  x. 

The  idea  behind  the  proof  is  this,  and  you  can  try  it  out  at  your  next  party.  You 
will  put  n  boxes  on  the  board  numbered  0  through  n  —  1 .  Each  person  counts  up 
the  number  of  people  he  or  she  knows  at  the  party.  You  ask  them  that  number  and 
write  their  name  in  the  box  with  the  same  number.  Note  that  each  person’s  answer 
corresponds  to  exactly  one  of  the  boxes  0  through  n  —  1 .  The  theorem  claims  that  at 
least  two  people’s  names  will  end  up  in  the  same  box. 

Proof.  We  imagine  n  boxes  that  are  numbered  0  through  n  —  1 .  For  an  integer  m 
with  0  <  m  <  n  —  1,  box  m  contains  the  names  of  the  people  who  know  m  people  at 
the  party. 

We  break  this  proof  into  two  cases.  First,  suppose  that  there  is  someone  at  the 
party  who  doesn’t  know  anyone.  We’ll  call  this  party  crasher  Ms.  X.  Now  if  we  pick 
some  other  party  attendee,  he  doesn’t  know  himself  and,  since  Ms.  X  doesn’t  know 
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him,  he  doesn’t  know  Ms.  X  either.  This  implies  that  he  knows  at  most  n  —  2  people 
at  the  party.  The  point  is  this:  No  one’s  name  can  be  in  the  box  labeled  n  —  1.  This 
means  that  the  names  of  the  n  people  are  in  the  n  —  1  boxes  labeled  0  through  n  —  2. 
Obviously  then,  there  is  a  box  with  at  least  two  names  in  it,  indicating  that  two  of 
those  people  know  the  same  number  of  people  and  we  are  done  in  this  case. 

Now  suppose  that  everyone  knows  at  least  one  person  at  the  party.  Then  no  one’s 
name  can  be  in  the  box  marked  0;  everyone’s  name  will  be  in  one  of  the  n—  1  boxes 
marked  1, . . .  ,n  —  1.  Once  again  we  have  n  names  in  n  —  1  boxes,  and  thus  at  least 
two  must  be  in  the  same  box.  □ 

This  theorem  and  its  proof  illustrate  the  idea  behind  the  pigeonhole  principle.  In 
its  popular  form,  the  principle  says  that  if  there  are  more  pigeons  than  holes,  then 
at  least  one  hole  is  the  home  of  more  than  one  pigeon.  There  are  many  wonderful 
applications  of  the  pigeonhole  principle  (and  many  can  be  found  at  the  website  [13] 
under  algebra). 

We  now  turn  to  the  more  precise  statement  of  the  pigeonhole  principle  and  its 
proof.  The  principle  is  attributed  to  Peter  Gustav  Lejeune  Dirichlet  and  is  also 
known  as  the  Dirichlet  principle  or  the  Dirichlet  drawer  principle.  (There  are  other 
Dirichlet  principles;  see  Spotlight:  Minimum  or  Infimum?  in  Chapter  17.)  Curiously, 
this  intuitively  obvious  principle  has  a  rather  intricate  proof. 

Theorem  22.2  (Pigeonhole  principle).  Let  m  and  n  be  positive  integers  with  m  >  n, 
and  let  f  be  a  map  satisfying  f  :  { 1 , . . . ,  m}  — >  { 1 , . . . ,  n}.  Then  f  is  not  one-to-one. 

Proof.  We  will  prove  this  theorem  by  induction  on  n.  The  assertion  that  we  will 
prove  is:  For  every  integer  m  such  that  m  >  n,  if  f m}  ->  {l,...,n},  then  / 
is  not  one-to-one. 

For  the  base  case,  n  =  1 .  Our  assumption  that  m  >  n  implies  that  m  >  1 .  For  every 
map  /  :  {1, . . .  ,m}  — »  {1},  it  is  clear  that  /( 1)  =  1  =  f(m).  Since  1  ^  m,  we  may 
conclude  that  /  is  not  one-to-one,  completing  the  base  step. 

For  the  induction  step,  let  n  £  Z+  and  suppose  that  if  m  is  an  integer  greater  than 
n  and  /  is  a  function  /  :  { 1 , . . . , m}  — ►  {1, . . .  ,n},  then  /  is  not  one-to-one.  We  will 
show  that  if  m  >  n+ 1  and  / :  {1, . . .  ,m}  — ►{l,...,w+l},  then  /  is  not  one-to-one. 

So  let  us  suppose  that  m  >  n  +  1  and  we  have  a  map 

/:{!)•••,«»}->  {l,...,n+l}. 

There  are  three  cases  to  consider:  Either  /  maps  nothing  to  n  +  1,  more  than  one 
element  to  n  +  1 ,  or  exactly  one  element  to  n  +  1 . 

For  the  first  case,  n  +  1  is  not  in  the  range  of  /,  so  /  actually  defines  a  map 
/  :  {1, . . .  ,m}  — ►  {1, . . .  ,n}.  Since  m>  n  +  1  >  n,  our  induction  hypothesis  tells  us 
that  /  is  not  one-to-one,  and  we  are  done  in  this  case. 

In  case  two,  there  exist  j,k  £  {1, . . .  ,m}  with  j  ^  k,  and  f(  j)  =  n+  1  =  f(k). 
Then  /  is  not  one-to-one,  and  we  are  done  in  this  case,  too. 

For  the  last  case,  we  may  assume  that  j  £  { 1 , . . .  ,ra}  is  the  only  integer  for  which 
f(  j)  =  n  +  1 .  We  now  define  the  function  g  :  { 1 , . . . ,  m}  — >  { 1 , . . . ,  m)  that  inter¬ 
changes  m  with  j  and  leaves  all  other  elements  of  { 1 , . . . , m)  fixed: 
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k  if  k  ^  j,m 
j  if  k  =  m 
m  if  k  =  j 
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g(k)  = 


Then  g  is  one-to-one.  Furthermore,  since  j  is  the  only  integer  that  /  maps  to  n  +  1 
we  know  that  ( f°g)(k )  =  f(g(k))  =  n+  1  if  and  only  if  g(k)  =  j.  Since  g  is  one-to- 
one,  this  happens  if  and  only  if  k  =  m.  Thus  (/°g)|{i,...jm-i}  maps  {1, ...  ,m—  1}  to 
{1, . . .  ,n}.  Since  we  assume  that  m  >  n+  1,  we  know  that  m  —  1  >  n.  Our  induction 
hypothesis  now  applies,  and  we  conclude  that  (/°g)|{i,...,m-i}  is  not  one-to-one. 
By  Problem  15.19  fog  is  not  one-to-one.  Since  g  is  one-to-one,  by  Theorem  16.7 
/  is  not  one-to-one  in  this  case  either. 

By  the  principle  of  mathematical  induction,  if  m,n  €  Z+  and  m  >  n,  then  no 
function  /  :  { 1 , . . . , m}  —>  {1 is  one-to-one.  □ 

The  proof  of  the  pigeonhole  principle  summarizes  much  of  what  you  learned: 
mathematical  induction,  proof  in  cases,  and  one-to-one  functions. 

We  are  now  in  a  position  to  prove  that  a  set  is  infinite. 

Theorem  22.3.  The  set  N  is  infinite. 

Proof.  Suppose  to  the  contrary  that  N  is  finite.  Since  N^0  there  exists  an  integer 
m  and  a  one-to-one  mapping,  g,  of  N  onto  {1,2,...,  m}.  Now  {l,2,...,m+l}CN, 
so  we  may  consider  the  restriction  g|{i,2,...,m+i}  :  {1,2, . . .  ,m+  1}  — >  {1,2, .. .  ,m}. 
The  pigeonhole  principle  (Theorem  22.2)  implies  that  g |{ij2,...,m+i}  is  not  one-to- 
one.  This,  in  turn,  implies  (as  you  surely  showed  in  Problem  15.19)  that  g  is  not 
one-to-one,  contradicting  our  choice  of  g.  Therefore,  it  must  be  the  case  that  N  is 
infinite.  □ 

Exercise  22.4.  Prove  that  Z  is  infinite.  Q 


The  next  exercise  is  similar  to  the  one  above,  but  requires  more  work. 

Exercise  22.5.  Prove  that  if  X  is  an  infinite  set,  then  the  power  set  of  X  is  infinite. 

o 

We  carefully  defined  what  it  means  for  a  set  to  be  finite,  but  so  far  we  have  not 
described  what  it  means  for  a  set  to  “have  n  elements.”  After  the  following  theorem, 
we  will  be  ready  to  do  that. 

Theorem  22.6.  Let  A  be  a  nonempty  finite  set.  There  is  a  unique  positive  integer  n 
such  that  A  «  { 1 

Before  we  begin,  note  that  there  are  two  things  to  show:  there  exists  a  positive 
integer  n  with  certain  properties  and  that  there  is  only  one  such  integer.  One  of  these 
should  be  easy.  Which  one? 
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Proof.  The  existence  of  some  n  £  Z+  such  that  there  is  a  bijection  / :  A  — >  {1 , . . .  ,n} 
is  guaranteed  by  the  definition  of  a  nonempty  finite  set.  So  all  we  have  to  do  is  show 
that  there  is  no  other  m  £  Z+  with  an  associated  bijection  g  :  A  — >  {1 , . . . ,  m}. 

Suppose  to  the  contrary  that  there  does  exist  such  a  positive  integer  m  with 
m  /  n,  and  bijective  function  g.  Since  m  f  n,  one  of  these  integers  must  be  larger 
than  the  other,  so  we  assume  that  m  >  n.  Since  g  is  a  bijection,  it  has  an  inverse. 
Composing  the  two  bijective  functions  /  and  g  1 ,  we  obtain  a  bijective  function 
fog -1  :  {1, . . .  ,m}  — >  {1, . . .  ,n}.  But  this  contradicts  the  pigeonhole  principle,  and 
we  conclude  that  n  is  unique.  □ 

The  integer  n  in  the  above  theorem  is  exactly  what  we  mean  by  the  “size  of  A”  or 
“number  of  elements  in  A.”  We  will  say  that  the  cardinality  of  a  finite  set  A  is  0  if  A 
is  empty  and  n  if  A  sa  {1, . . .  ,n}.  In  symbols,  we  write  \A  \  =  n,  where  n  £  N.  (Why 
couldn't  we  define  the  cardinality  of  a  set  before  we  proved  the  theorem?)  While  it 
is  possible  to  define  cardinality  in  more  generality,  we  have  only  defined  it  in  the 
case  that  A  is  finite. 


Definition 

Definition  22.1.  The  cardinality  of  a  finite  set  A  is  0  if  A  is  empty  and  n  if 
A  «  {1 In  symbols,  we  write  |A|  =  «,  where  n  £  N. 


Solutions  to  Exercises 

Solution  (22.4). 

Proof.  We  know  that  NcZ,  and  Theorem  22.3  tells  us  that  N  is  infinite.  Since 
Corollary  21.10  says  that  every  subset  of  a  finite  set  is  finite,  our  set  Z  must  be 
infinite.  □ 

Solution  (22.5). 

Proof.  Define  a  map  /  :  X  — >  ^(X)  by  f(x)  =  {jr}  for  all  x  £  X .  It  is  clear  that  / 
is  well-defined  and  one-to-one.  Therefore,  /  maps  X  onto  a  subset,  ,&/,  of  2?(X). 
Since  X  is  infinite  and  /  is  a  bijection  of  X  onto  ■<//,  we  know  that  ///  must  be  infinite 
as  well.  Thus,  sd  is  an  infinite  subset  of  ^a( X ),  and  we  may  use  Corollary  21.10  to 
conclude  that  ^(X)  is  infinite.  □ 


Problems 

Problem  22.1.  Consider  the  story  of  n  people  at  a  party  in  Theorem  22.1.  Suppose 
someone  else  has  a  rival  party  the  same  evening,  and  no  one  can  attend  both.  Some- 
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one  takes  a  picture  of  the  people  at  the  rival  party  and  shows  it  to  everyone  at  your 
party.  Your  party  isn’t  that  much  fun,  so  you  each  look  at  the  picture  and  say  how 
many  people  you  know  at  the  other  party.  No  one  says  the  same  number.  What  can 
you  conclude  about  the  number  of  people  attending  the  other  party? 

Problem  22.2.  (a)  Suppose  there  are  15  people  in  a  class.  Show  that  two  people 

must  be  born  in  the  same  month. 

(b)  A  conductor  has  just  taken  on  a  new  job  in  a  small  town  where  he  has  five 
trumpet  players  in  his  orchestra.  He  has  a  concert  every  other  evening  for 
his  first  year.  Traditionally  the  players  are  seated  from  left  to  right  in  order  of 
decreasing  musical  ability.  The  conductor  does  not  want  to  offend  the  players, 
so  he  has  decided  to  seat  them  differently  at  each  performance.  Can  he  do  it? 
Why  or  why  not? 

Problem  22.3.  Suppose  21  numbers  are  chosen  from  the  set  {1, . . .  ,40}.  Show  that 
among  the  chosen  numbers  there  are  (at  least)  two  of  them,  n  and  m,  such  that 
n  —  m= 1 . 

Problem  22.4.  Let  S  be  a  region  in  the  plane  bounded  by  a  square  with  sides  of 
length  two.  Prove  that  if  we  put  five  points  in  S,  there  exist  (at  least)  two  of  these 
points  that  are  at  most  a  distance  of  \/2  apart. 

Problem  22.5.  Let  and  let  /  :  {1 , 2, . . . ,  2n  +  1}  — >  {1 , 2, . . . ,  2n  +  1 }  be  a 

bijective  function.  Prove  that  for  some  odd  integer  k£  { 1 , 2 , . . . ,  2n  +  1} ,  the  integer 
f(k)  is  also  odd. 

Problem  22.6.  Let  /  :  { 1 , 2, . . . ,  99}  — >  { 1 , 2, . . . ,  99}  be  an  injective  function.  De¬ 
fine  g  :  {1,2, . . .  ,99}  — >  N  by  g(n)  =  «  +  /(«).  Prove  that  there  exists  an  integer  n 
such  that  g(n)  is  even. 

Problem  22.7.  Prove  the  following  alternate  form  of  the  pigeonhole  principle. 

Let  A  and  B  be  nonempty  finite  sets,  and  suppose  that  |A|  >  \B\.  If  /  :  A  — >  B  is  a 
function,  then  /  is  not  one-to-one. 

Problem*  22.8.  Show  that  Q  is  infinite. 

Problem  22.9.  Using  only  the  definition  of  finite  and  the  pigeonhole  principle, 
prove  that  R  is  infinite. 

Problem  22.10.  Let  A  be  a  set,  and  suppose  that  B  is  an  infinite  subset  of  A.  Show 
that  A  must  be  infinite. 

Problem  22.11.  Suppose  that  A  is  an  infinite  set,  B  is  a  finite  set,  and  f  :A—>  B  is  a 
function.  Show  that  there  exists  b  €  B  such  that  /-1  ({&})  is  infinite. 

Problem  22.12.  Let  X  be  an  infinite  set,  and  A  and  B  be  finite  subsets  of  X .  Answer 
each  of  the  following,  giving  reasons  for  your  answers: 


(a)  Is  A  HB  finite  or  infinite? 
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(b)  Is  A  \  B  finite  or  infinite? 

(c)  Is  X  \  A  finite  or  infinite? 

(d)  Is  A  U  B  finite  or  infinite? 

(e)  If  /  :  A  — >  X  is  a  one-to-one  function,  is  /(A)  finite  or  infinite? 

Problem  22.13.  Let  A,B,  and  Cbe  finite  sets. 

(a)  Recall  that  we  showed  that  if  A  and  B  are  disjoint,  then  A  U  B  is  finite.  Look 
over  the  proof  outlined  in  Problem  21.9  and  determine  |A  U  Z?|  in  terms  of  |A| 
and  \B\,  assuming  that  A  and  B  are  disjoint. 

(b)  Suppose  that  A  and  B  are  not  disjoint.  Show  that  |A  UB|  =  |A |  +  |Z?|  —  |A  nfi|. 

(c)  Find  a  formula  that  works  for  three  sets  A .  /I,  and  C.  (You  don’t  need  to  prove 
that  your  formula  works.) 

Problem  22.14.  (a)  Suppose  that  A  and  B  are  finite  sets  with  |A|  =  m  and  \B\  =  n. 

In  Problem  21.11  you  showed  that  if  A  and  B  are  finite,  then  Ax  B  is  finite. 
Look  over  the  proof  and  determine  |A  x  B\  in  terms  of  m  and  n. 

(b)  Suppose  that  Ai  ,A2, . . .  ,/L  are  finite  sets.  Guess  a  formula  for  the  cardinality 
of  Ai  x  A2  x  •  •  •  x  A*,  (in  terms  of  |Aj  |,  | A2 1 ,  •  • .,  and  |A*|).  Prove  that  your 
formula  is  correct. 

Problem  22.15.  Prove  that  if  X  is  a  finite  set  with  \X\  =  n  G  N,  then  \£?(X)  \  =2". 

Problem  22.16.  Each  of  the  problems  below  is  an  application  of  one  of  the  counting 
principles  given  in  Problems  22.13  and  22.14.  Decide  which  part  of  that  problem 
applies,  and  use  it  to  answer  the  problem. 

(a)  Thirty  second-graders,  twenty-five  third-graders,  and  fifteen  fourth-graders 
entered  an  art  contest.  Three  prizes  were  awarded,  one  for  each  grade.  In  how 
many  ways  can  the  prizes  be  awarded  to  three  of  the  children?  (Don’t  forget 
to  say  which  formula  from  Problem  22. 13  or  22. 14  applies.) 

(b)  Suppose  that  there  are  100  people  in  a  room.  Of  these  55  are  men,  33  are 
Swiss,  10  are  Swiss  males.  How  many  are  Swiss  or  male  (or  both)?  (Don’t 
forget  to  say  which  formula  from  Problem  22.13  or  22.14  applies.) 

The  rest  of  the  problems  are  interrelated.  If  you  can’t  see  how  to  do  the  prob¬ 
lem  you  are  working  on,  look  at  the  results  from  the  previous  problems  and  Prob¬ 
lem  22.13. 

Problem  22.17.  Let  A  be  a  nonempty  finite  set  with  |A|  =  n  and  let  a  G  A.  Prove  that 
A  \  {a}  is  finite  and  |A  \  {«} |  =  n  —  1. 

Problem  22.18.  (a)  Suppose  that  A  is  a  finite  set  and  B  C  A.  We  showed  that  B  is 

finite.  Show  that  |B|  <  |A|. 

(b)  Suppose  that  A  is  a  finite  set  and  B  C  A.  Show  that  if  li  f  A,  then  \B\  <  |A|. 

(c)  Show  that  if  two  finite  sets  A  and  B  satisfy  B  C  A  and  |A|  <  |B|,  then  A=B. 

Problem  22.19.  Suppose  that  A  and  B  are  finite  sets  and  /  :  A  — >  B  is  one-to-one. 
Show  that  |A|  <  |Z?|. 
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Problem  22.20.  Let  A  and  B  be  sets  with  A  finite.  Let  /  :  A  — »  B.  Prove  that 
|ran(/)|  <  |A|. 

Problem  22.21.  Let  A  be  a  finite  set.  Show  that  a  function  /  :  A  — >  A  is  one-to-one 
if  and  only  if  it  is  onto.  Is  this  still  true  if  A  is  infinite? 


Chapter  23 

Countable  and  Uncountable  Sets 


I  see  it  but  I  do  not  believe  it. — Georg  Cantor  [59,  p.  997] 

Having  mastered  finite  sets,  we  now  turn  to  understanding  the  infinite.  We  know 
that  N  is  infinite,  and  we  know  that  Q  is  infinite  (see  Problem  22.8).  Are  they  equiv¬ 
alent?  In  some  sense,  we  can  count  N  and  it  may  feel  as  though  we  cannot  count 
Q — that  is,  as  though  we  cannot  list  a  first  element,  second  element,  third  element, 
and  so  on.  However,  we  shall  see  that  Q  and  N  are,  in  fact,  equivalent. 

An  infinite  set  A  is  said  to  be  countably  infinite  if  A  «  N.  In  Chapter  21  we 
showed  that  Z  ss  N  and  2Z  «  N,  so  these  also  are  countably  infinite.  It  is  also  easy 
to  show  that  Z+  =  N  \  {0}  «  N.  A  set  is  countable  if  it  is  either  finite  or  countably 
infinite.  A  set  is  said  to  be  uncountable  if  it  is  not  countable.  Note  that  if  we  only 
know  that  a  set  is  countable,  we  don’t  necessarily  know  if  it  is  finite  or  infinite.  If 
we  have  an  infinite  countable  set,  it  automatically  is  equivalent  to  N. 

Exercise  23.1.  Let  A  and  B  be  two  countably  infinite  sets.  Prove  that  there  is  a  In¬ 
jection  of  A  onto  B. 

Theorem  23.2.  Every  subset  ofN  is  countable. 

The  proof  of  this  theorem  will  be  presented  as  an  exercise  in  reading  mathemat¬ 
ics. 

Exercise  23.3.  This  is  an  exercise  in  reading  a  proof.  We  ask  that  as  you  read  you 
pretend  that  the  set  T  (appearing  in  the  proof)  is  the  set  of  prime  numbers.  Of  course, 
you  are  not  allowed  to  pick  a  particular  subset,  call  it  T,  and  conclude  that  you 
have  proved  the  theorem.  However,  for  the  purpose  of  understanding  someone  else’s 
proof,  this  might  be  quite  helpful.  We  will  call  this  set  the  demo,  and  denote  it  by 
T .  Whenever  you  see  (?)  you  should  figure  out  what  happens  for  this  set.  If  at  the 
end  you  remain  largely  unsatisfied,  pick  another  set  for  T  and  try  again.  No  matter 
what,  you’ll  understand  more  of  the  proof  than  if  you  hadn’t  tried  anything  at  all.  So 
read  the  proof,  think  about  the  question  marks,  and  then  answer  the  set  of  questions 
provided  below. 
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(a)  In  the  statement  of  Theorem  18.6  a  function  /  appears.  What  is  this  function 
in  the  proof  (when  the  demo  set  T  is  used)? 

(b)  Find  g(0),g(l), . . .  ,g( 5)  in  the  demo.  What  is  the  subset  of  T  that  is  used  to 
define  g( 6)  in  our  demo?  What  is  g(6)  (when  you  use  the  demo  set  T  to  define 
g( 6))? 

(c)  For  one-to-one,  we  thought  we  had  to  show  that  if  g(j)  =  g(£),  then  j  =  i. 
What’s  going  on  in  this  proof? 

(d)  Say  t  =  19.  What  element  is  mapped  to  t  when  the  demo  set  is  used?  Q 


Before  proceeding  with  our  proof,  we  give  a  sense  of  the  idea  behind  it:  If  T  is  an 
infinite  subset  of  N,  we  will  construct  a  function  g  :  N  — >  T  recursively,  listing  the 
elements  of  T  in  increasing  order.  Since  the  list  is  strictly  increasing,  g  will  be  one- 
to-one.  And,  since  the  list  is  constructed  to  look  at  what  has  been  chosen  and  then 
move  on  to  the  next  largest  number  in  T,  the  function  g  will  also  be  onto.  This  part 
of  the  proof  will  require  some  work,  however,  and  we  will  proceed  as  follows:  We 
will  suppose  that  g  is  not  onto.  The  well-ordering  principle  will  allow  us  to  conclude 
that  there  is  a  smallest  element  in  T  that  is  not  in  the  range  of  g.  However,  g  will  be 
constructed  in  a  way  that  will  not  allow  us  to  “skip  over”  elements  of  T  and  so  we 
will  obtain  a  contradiction.  We  now  make  these  ideas  precise. 

Note  that  the  (?)’s  in  this  proof  refer  to  Exercise  23.3.  Once  you  have  worked 
through  the  exercise,  you  should  be  able  to  read  through  the  proof,  ignoring  the 
symbol  (?)  as  you  read. 

Proof.  Let  T  denote  a  subset  of  N.  If  7  is  finite,  it  is  countable  and  we  are  done.  (?) 
We  suppose,  then,  that  T  is  infinite  and  show  that  it  is  countably  infinite.  To  this  end 
we  construct  a  bijection  g  :  N  — >  T  recursively. 

Since  T  is  a  nonempty  subset  of  N,  the  well-ordering  principle  implies  that  T 
has  a  least  element.  We  set  g(0)  =  min  T .  (?)  Note  that  for  any  t  £  T  we  can  write 
T  =  {x  £  T  :  x  <  t }  U  {x  £  T  :  x  >  t }.  (?)  Since  {x  £  T  :  x  <  t}  C  {0, . . .  ,t },  we  may 
use  Theorem  21.9  to  conclude  that  {x£T  :  x  <  f}  is  finite.  Since  T  is  infinite.  The¬ 
orem  21.11  implies  that  {x  £  T  :  x  >  t}  is  infinite  for  every  t  £  T.  (?)  In  particular, 
the  set  {x  £  T  :  x  >  t}  is  nonempty  for  every  t  £  T  and  contains  a  minimum  by  the 
well-ordering  principle.  Thus,  we  can  define  g(n  +  1)  =  minjjr  £  T  '■  x>  g(n)}.  (?) 
By  the  recursion  theorem.  Theorem  18.6,  the  function  g  :  N  — >•  T  is  well-defined. 

In  order  to  prove  that  our  function  g  is  one-to-one  we  first  show  that  if  k  £  N  and 
n  £  Z+,  then  g(k  +  n)  >  g(k).  To  show  this  we  use  induction  on  n.  By  definition, 
g(k+  1)  =  min{x  £  T  :  x  >  g(fc)}  >  g(k).  (?)  This  concludes  the  base  step.  For  the 
induction  step,  we  let  n  £  Z+  and  suppose  that  g(k  +  n)  >  g(k).  Then  g(k  +  n+  1) 
=  min{x  £  T  :  x  >  g(k  +  n )}  >  g(k  +  n).  (?)  By  the  induction  hypothesis  we  get 
g(k  +  n  +  1)  >  g(k).  The  result  follows  by  induction.  It  is  now  immediate  that  if 
j  ^  £,  then  g(  j)  ^  g(£)  (in  Exercise  23.3  part  (c)  you  are  asked  to  provide  the  details 
for  this  claim).  Hence  g  is  one-to-one. 

Finally,  we  show  that  the  function  g  is  onto.  Suppose  to  the  contrary  that  it  is  not. 
Then  the  set  S  =  {x  £  T  :  x  f  ran  (g) }  is  nonempty.  By  the  well-ordering  principle. 
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since  S  is  a  subset  of  N  it  contains  a  least  element.  Call  this  element  u{l),  and  note 
that  a  is  simply  the  smallest  element  in  T  that  is  not  in  the  range  of  g,  and  therefore 
everything  strictly  smaller  than  a  (and  in  T)  is  in  the  range  of  g.  If  a  =  min  T,  then 
a  =  g(0),  which  is  not  possible.  Hence  the  set  {x  £T  :  x  <  a}  ^  0.  This  set  is  also 
bounded  above  (by  a),  and  by  Problem  12.18  it  has  a  maximum,  which  we  call  b.{l) 
Since  b  =  max{x  £  T  :  x  <  a},  we  know  that  b  is  strictly  smaller  than  a  and  so  b 
is  in  the  range  of  g.  Therefore,  there  exists  n  £  N  with  g(n)  =  b.  Now  there  can  be 
nothing  in  T  between  a  and  b,  so  {x  £  T  :  x  >  b}  =  {x  £  T  :  x  >  a},  and  we  have 

g(n  +  1)  =  min{x  £  T  :  x  >  g{n)} 

=  min{x  £  T  :x>  b}  =  min{.r  £  T  :  x  >  a}.(?) 

Therefore  g(n  +  1)  =  a,  contradicting  our  choice  of  a  £  .S',  so  g  is  onto. 

Since  g  is  a  bijection,  T  is  also  countable  if  it  is  infinite.  □ 

As  you  read  the  theorems  and  corollaries  below,  think  about  whether  or  not  you 
know  a  corresponding  result  for  finite  sets.  If  so,  what  was  the  proof?  Do  the  ideas 
from  those  proofs  help  you  here?  Why  or  why  not?  We  leave  the  corollaries  for  you 
to  prove  in  Problems  23.9  and  23.10. 

Corollary  23.4.  Every  subset  of  a  countable  set  is  countable. 

Remember  that  when  we  say  that  a  set  is  countable,  we  mean  that  it  is  finite  or 
countably  infinite.  The  next  exercise  will  often  allow  you  to  handle  both  cases  at 
once. 

Exercise  23.5.  Prove  that  a  nonempty  set  A  is  countable  if  and  only  if  there  exists  a 
one-to-one  function  /  :  A  — >  N.  O 

Theorem  23.6.  Suppose  that  A  and  B  are  countable.  Then  AUB  is  countable. 

Our  proof  begins  with  something  we  have  used  several  times  before. 

Proof.  If  A  C  B  or  B  C  A,  the  result  is  clear.  So  suppose  that  A\B  and  B\A  are  both 
nonempty.  Now  note  that  A  UB  =  A  U  (B\  A)  and  A  (T  (B\  A)  =0.  Since  B\A  QB, 
Corollary  23.4  implies  that  B \ A  is  countable.  Further,  A  and  B\A  are  countable,  so 
by  Exercise  23.5  there  exist  one-to-one  functions  /  and  g  such  that  f  :  A  N  and 
g  :  B\A  — >  N.  Define  H  :  AUB  — >■  N  by 

ffM=|2/W  if  x  £  A 
K  )  \2g(x)  +  l  if x£B\A  ' 

You  can  check  (as  you  have  many  times  before)  that  H  is  well-defined  and  one-to- 
one.  Using  Exercise  23.5  once  again,  we  conclude  that  A  UB  is  countable.  □ 

Corollary  23.7.  The  union  of  finitely  many  countable  sets  is  countable. 
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In  the  next  theorem,  we  want  to  show  that  N  x  N  is  equivalent  to  N.  It  is  oh, 
so  tempting  to  go  to  the  definition  and  try  to  define  a  function  that  is  a  bijection 
from  our  set  onto  N;  after  all,  that  is  the  definition  of  equivalence.  But  if  we  do 
everything  using  definitions,  we  will  not  be  taking  advantage  of  the  useful  body  of 
mathematics  we  have  proved  thus  far,  and  we  will  have  to  re -prove  everything  we 
have  done.  Some  of  it  was  quite  difficult  to  prove!  Life  will  be  much  easier  if  we 
think  about  the  theorems  we  have  proved  already  and  see  when  and  how  we  can  use 
them. 

Theorem  23.8.  The  set  N  x  N  is  countable. 

Proof.  We  show  that  N  x  N  is  countable  by  defining  a  function  /  :NxN-^N 
explicitly.  So,  let  f(n,m)  =  2”3m  for  all  (, n,m )  £  N  x  N.  This  is  clearly  a  well- 
defined  function.  (Note  that  this  function  is  not  onto,  since  a  number  like  7  is  not 
in  the  range.  Therefore,  we  will  not  try  to  show  that  /  is  a  bijection  between  N  x  N 
and  N.)  Since  the  prime  factorization  of  a  natural  number  is  unique,  the  function  is 
one-to-one.  Thus  we  have  a  one-to-one  mapping  /  :  N  X  N  — >  N.  By  Exercise  23.5, 
we  conclude  that  N  x  N  is  countable.  □ 

Though  we  did  not  say  so  explicitly,  N  x  N  is  infinite.  Therefore,  what  we  have 
shown  is  that  N  x  N  is  equivalent  to  N. 

Exercise  23.9.  Let  A  be  a  finite  set  and  let  B  be  a  countable  set.  Prove  that  Ax  B  is 
countable.  Q 


Corollary  23.10.  If  A  and  B  are  countable  sets,  then  Ax  B  is  countable. 

Assuming  you  were  paying  attention  to  all  the  previous  results,  this  will  not  be 
hard  to  prove  (see  Problem  23.1 1). 

We  are  now  ready  for  the  two  main  theorems  of  this  chapter.  After  all  this  work, 
we  can  finally  show  that  the  set  of  rational  numbers  is  countably  infinite. 

Theorem  23.11.  The  set  of  rational  numbers,  Q,  is  countably  infinite. 

What  follows  is  a  natural  way  to  attempt  to  prove  this.  It  is,  unfortunately,  incor¬ 
rect.  But  it’s  worthwhile  to  present  it,  see  what  goes  wrong,  and  fix  it. 

Not  a  proof.  The  rationals  can  be  thought  of  as  p/q  where  p  and  q  are  integers  with 
q^f  0.  Thus,  we  can  define  a  map  from  Q  to  Z  x  (Z\  {0})  by  f{p/q)  =  ( p,q ).  Then 
/  is  bijective,  so  Q  ss  Z  x  (Z\  {0}).  By  Corollary  23.10,  the  latter  set  is  countable. 
Thus  we  conclude  that  Q  is  countable.  H 

As  we  mentioned  (though  not  quite  this  dramatically)  there’s  a  HUGE  error  in 
this  proof.  Find  it,  fix  it,  and  then  read  on  and  see  if  you  really  figured  it  out. 

The  problem  above  was  that  the  function  was  not  well-defined.  So  let’s  try  again. 
In  the  proof  below,  we  begin  by  considering  the  positive  rationals  so  that  we  don’t 
have  to  worry  about  whether  to  put  the  minus  sign  in  the  numerator  or  denominator. 
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Proof.  [Proof;  the  real  thing]  We  will  begin  by  showing  that  Q+  is  countable.  We 
define  / ;  Q+  — >  N  x  N  as  follows.  Write  each  member  of  Q+  as  p/q  where  p,q>  0 
and  p/q  is  in  reduced  form;  that  is,  p  and  q  have  no  positive  common  factor  other 
than  1.  Now  define  f(p/q )  =  ( p,q ).  Because  p/q  is  in  reduced  form,  /  is  well- 
defined  and  one-to-one.  Since  N  x  N  is  countable  (Theorem  23.8),  and  /(Q+)  is  a 
subset  of  it,  we  know  from  Corollary  23.4  that  /(Q+)  is  countable.  We  conclude 
that  Q+  is  countable.  Now  the  set  of  negative  rationals,  Q~,  is  equivalent  to  Q+. 
Since  Q  =  Q+  U  U  {0},  and  we  have  a  finite  union  of  countable  sets,  we  use 
Corollary  23.7  to  conclude  that  Q  is  countable.  Since  Q  is  infinite  we  know  that  it 
is  countably  infinite.  □ 

Looks  like  we  live  in  a  countable  world!  Not  quite — it’s  time  to  give  an  example 
of  an  uncountable  set.  The  next  theorem  will  show  that  the  set  of  real  numbers  is 
uncountable. 

There’s  one  sticky  point  in  our  proof  that  the  reals  are  uncountable.  We  will  use 
the  decimal  representation  of  real  numbers  in  (0,1).  Thus,  a  word  about  decimal 
expansions  is  in  order  here.  Each  element  of  (0,1)  has  a  decimal  representation; 
that  is,  for X  £  (0,1),  there  exist  integers  01,02, . . .  ,a„, . . .  with  0  <  an  <  9  such  that 

x  =  0.a\a2  ...an _ In  Problem  20. 15,  we  showed  that  0.999 . . .  =  1 .000 _ This 

means  that  the  number  1  has  two  representations.  In  fact,  many  numbers  in  (0, 1) 
have  two  decimal  representations.  It  can  be  shown,  however,  that  the  only  time  this 
can  happen  is  when  the  representations  are  of  the  form  0.010203  ■  •  an999 . . .  for 
some  n  €  Z+,  or  0.010203 . .  .o„,1000. . .  for  some  m  £  Z+.  When  given  a  choice, 
we  will  always  choose  the  representation  ending  with  repeated  9’s.  (See  also  Prob¬ 
lem  20.18.) 

Theorem  23.12.  The  set  of  real  numbers,  R,  is  uncountable. 

The  idea  of  this  proof  is  due  to  Georg  Cantor  and  is  called  Cantor’s  diagonal 
argument. 

Proof.  We  will  suppose,  to  the  contrary,  that  R  is  countable  and  see  what  happens. 
Since  every  subset  of  a  countable  set  is  countable,  the  open  interval  (0,1)  must 
be  countable,  too.  Since  (0,1)  is  clearly  infinite,  and  we  have  shown  that  Z+  is 
countably  infinite,  there  exists  a  bijective  function  /  :  Z+  — >  (0,1).  We  will  list  the 
values  of  /  using  the  decimal  expansion  of  each  element  of  (0,1).  So, 

/(l)  =  0.anai2«i3  •  •  • 

/( 2)  =  0.^21^22^23  •  •  • 

/( 3)  =  0. 031032033 . . . 


where  each  represents  an  integer  between  0  and  9.  Since  /  is  onto,  each  number 
in  (0, 1)  appears  in  this  list. 

The  odd  thing  is  this:  we  can  construct  a  number  b  =  0.b\b2 ...  £  (0, 1)  not  in 
this  list  (hence  showing  that  our  function  cannot  possibly  be  onto)  by  describing  its 
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decimal  representation  as  follows.  Look  at  /(l).  If  on  =  2,  let  b\  =  3.  If,  on  the 
other  hand,  a\\  /  2,  define  b\  =  2.  Then  the  first  digits  of  /( 1)  and  h  are  different, 
so  b  is  not  /(l).  For  bi,  if  «22  =  2,  let  bi  =  3.  If,  on  the  other  hand,  <222  7^  2,  define 
Z?2  =  2.  Then  the  second  digits  of  b  and  /( 2)  are  different.  So  b  is  not  /( 2).  Now 
compare  the  element  b  we  have  constructed  with  the  list  below: 

/(l)  =  0.an«i2ai3  . . . 

/( 2)  =  0.(22 1322^23  •  •  • 

/( 3)  =  0.(231(232333  . . . 

f{ll)  =  0.(2/;  ]  (1  ■  ■  ■  3 n n  ■  ■  ■ 


b  =  0.bib2b3 _ 

We  constructed  b  so  that  bn  ^  ann,  and  therefore  b  ^  f(n)  for  every  n.  Then  b  can’t 
be  in  our  list,  which  is  a  bit  bizarre  since  we  claim  to  have  numbered  all  the  elements 
in  (0, 1),  and  b  is  certainly  one  of  the  things  we  numbered.  This  contradiction  must 
mean  that  we  have  assumed  falsely  that  R  is  countable.  □ 

A  word  of  caution:  students  often  forget  that  some  of  the  theorems  proved  in 
previous  chapters  and  some  of  the  definitions  only  apply  to  finite  sets.  Since  the 
notion  of  finite  and  infinite  is  often  counterintuitive,  you  really  must  make  sure  that 
you  check  the  hypotheses  of  the  theorems  you  wish  to  apply  before  you  apply  the 
theorems. 

Reactions  to  Cantor’s  work  in  set  theory  were  mixed  (see,  for  example,  [59,  p. 
1003]).  Leopold  Kronecker  opposed  Cantor’s  theory  and  so  did  Henri  Poincare  (see 
Spotlight:  The  Continuum  Hypothesis  on  page  270).  In  a  discussion  of  Cantor’s 
work,  Poincare  [83]  said,  “For  my  part,  and  I  am  not  alone,  I  think  that  the  important 
thing  is  never  to  introduce  objects  other  than  those  that  can  be  completely  defined 
in  a  finite  number  of  words.”1  Hilbert  and  Russell  praised  Cantor’s  work.  In  his 
memorial  speech  for  Hermann  Minkowski,  Hilbert  points  out  that  Minkowski  was 
the  first  mathematician  of  their  time  who  understood  the  importance  of  Cantor’s 
work.  He  quotes  Minkowski  as  saying,  “History  will  call  Cantor  one  of  the  most 
profound  mathematicians  of  our  time;  it  is  truly  regretful  that  a  very  prominent 
mathematician  [here  Hilbert  tells  us  that  this  mathematician  is  Kronecker]  led  an 
opposition  not  based  entirely  upon  factual  grounds,  which  spoiled  Cantor’s  pleasure 
in  his  scientific  investigations.”1 


Definitions 

Definition  23.1.  An  infinite  set  A  is  said  to  be  countably  infinite  if  A  sa  N. 

1  The  translation  is  ours. 
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Definition  23.2.  A  set  is  countable  if  it  is  either  finite  or  countably  infinite.  A  set  is 
said  to  be  uncountable  if  it  is  not  countable. 


Solutions  to  Exercises 

Solution  (23.1). 

Proof.  Since  A  and  B  are  countably  infinite,  we  know  that  A  «  N  and  B  m  N.  By  the 
transitivity  (and  symmetry)  of  the  relation  ss,  we  may  conclude  that  Ak  B.  By  the 
definition  of  this  equivalence  relation,  there  exists  a  bijective  function  /  mapping  A 
onto  B.  □ 

Solution  (23.3). 

(a)  For  the  demo,  f:T  — *  T  is  defined  by  f(t)  =  min{;r  £  T  :  x>  t}. 

(b)  g(0)  =  2,g(l)  =  3,g(2)  =  5,g(3)  =  7,g(4)  =  ll,g(5)  =  13.  The  subset  of 
T  used  is  {x  £  T  :  x  >  g(5)}  =  {x  £  T  :  x  >  13}  =  {17, 19,23, . . .}.  Thus 
g(6)  =  min{17, 19,23, . . .}  =  17. 

(c)  Suppose  that  j  /  L  We  may  assume  that  j  <  £.  We  showed  that  this  implies 
that  g(j)  <  g(£).  Hence  g(j)  ^  g(£).  This  is  the  contrapositive  of  the  statement 
in  the  question,  and  therefore  the  statement  “if  g(j)  =  g(£),  then  j  =  £”  is  true 
as  well. 

(d)  From  the  list  in  part  (b)  we  deduce  that  19  =  g(l). 

Solution  (23.5).  First  suppose  that  A  is  countable.  If  A  is  finite,  then  since  A^0 
there  exists  an  integer  n  and  a  bijection  /  :  A  — >  {1,2, .. . ,«}.  In  particular,  /  is  a 
one-to-one  mapping  of  A  into  N.  So  we  have  found  our  /,  if  A  is  finite.  If  A  is 
infinite,  then  A  is  countably  infinite.  Therefore,  there  is  a  bijection  / :  A  — >  N.  Thus, 
in  both  cases,  we  have  a  one-to-one  mapping  /  :  A  — >  N. 

Now  suppose  that  we  have  a  one-to-one  mapping  f  of  A  into  N.  Then  /  maps 
A  onto  its  range.  Therefore  A  «  ran(/).  But  ran(/)  is  a  subset  of  N,  and  by  Theo¬ 
rem  23.2  we  know  that  it  must  be  countable.  Thus  A  is  countable,  as  desired. 

Solution  (23.9).  Here’s  one  way  to  prove  this: 

Proof.  If  A  =  0,  then  Ax  B  is  empty  and  we  are  done.  Otherwise  there  exists  a 
positive  integer  n  and  a  bijective  function  /  :  {1,2 A.  Thus  we  may  write 
A  x  B  as  a  union:  AxB  =  U"=i  ({/({)}  x  B).  It  is  easy  to  check  that  for  each  j  the 
function  gj  :  B  ->  {/(;)}  x  B  defined  by  gj(b)  =  (f(j),b)  is  a  bijection.  Therefore, 
{/({)}  x  B  is  countable  for  each  j.  Thus,  we  have  written  A  x  B  as  a  finite  union  of 
countable  sets,  and  by  Corollary  23.7  we  know  that  a  finite  union  of  countable  sets 
is  countable.  Thus  AxB  is  countable.  □ 
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Problems 

Problem  23.1.  Give  an  example,  if  possible,  of  each  of  the  following: 

(a)  a  countably  infinite  collection  of  pairwise  disjoint  finite  sets  whose  union  is 
countably  infinite  (see  Problem  8.18  for  the  definition  of  pairwise  disjoint); 

(b)  a  countably  infinite  collection  of  nonempty  sets  whose  union  is  finite; 

(c)  a  countably  infinite  collection  of  pairwise  disjoint  nonempty  sets  whose  union 
is  finite. 

Problem  23.2.  Which  of  the  following  sets  are  finite?  countably  infinite?  uncount¬ 
able?  (Be  careful — don’t  apply  theorems  for  finite  sets  to  infinite  sets  and  don’t  ap¬ 
ply  theorems  for  countable  sets  to  uncountable  sets!)  Give  reasons  for  your  answers 
for  each  of  the  following: 

(a)  {1  /n  :  n  e  Z\{0}}; 

(b)  R\N; 

(c)  {rgN:  |jt  —  7|  >  |x|}; 

(d)  2Zx3Z. 

Problem  23.3.  For  each  of  the  following  sets  decide  whether  it  is  countable  or  un¬ 
countable  and  justify  your  answer: 

(a)  The  set  of  all  lines  with  rational  slopes; 

(b)  Q\{0}; 

(c)  M\ {1,3}; 

(d)  {(x,y)  £  R  x  R  :  x  +  y  =  1}; 

(e)  [0,°o). 

Problem  23.4.  Is  the  set  of  all  infinite  sequences  of  0’s  and  l’s  finite,  countably 
infinite,  or  uncountable?  Guess  and  then  prove,  please. 

Problem  23.5.  Suppose  that  A  C  B  C  C,  that  the  sets  A  and  C  are  equivalent,  and 
that  C  is  countable.  Is  A  «  B1  Prove  or  give  a  counterexample. 

Problem  23.6.  (a)  Give  an  example  of  two  sets  A  and  B,  such  that  B  C  A  and 

BkA,  but  B  jtzA. 

(b)  Prove  that  if  A  is  a  countably  infinite  set,  then  there  is  always  a  subset  B  of  A 
such  that  B  CA  and  B  ss  A. 

(c)  Prove  that  if  A  is  an  uncountable  set,  then  there  is  always  a  subset  B  of  A  such 
that  B  C  A  and  B  is  also  uncountable. 

(d)  Prove  that  if  A  is  a  finite  set,  B  CA,  and  B  sa  A,  then  B  =  A. 

Problem  23.7.  Prove  that  a  set  A  is  uncountable  if  there  is  an  injective  function 
/:(0,1)-A. 

Problem  23.8.  (a)  Let  X,Y  C  R.  Suppose  that  f  :  X  — >  Y  is  a  function  with  the 

property  that  for  all  x,y  £  X,  if  x  <  y,  then  f(x)  <  f(y).  Prove  that  /  is  one- 
to-one. 
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(b)  Suppose  that  g  :  ^({1,2})  — >  ^({1,2,3})  is  a  function  with  the  property 
that  for  all  A,B  £  ^({1,2}),  if  A  C  B,  then  g(A)  C  g(B).  Is  g  necessarily 
one-to-one?  Prove  it  or  give  an  example  of  g  that  is  not  one-to-one. 

Problem  23.9.  Prove  Corollary  23.4. 

Problem  23.10.  Prove  Corollary  23.7.  To  do  this,  note  that  the  corollary  can  be 
restated  a  bit  more  formally  as  follows.  If  for  some  positive  integer  n,  we  have  n  sets 
A i, . . .  ,A„,  and  each  one  is  countable,  then  (J"=i  A,-  is  countable. 

Problem  23.11.  Prove  Corollary  23.10. 

Problem  23.12.  There  is  another  way  to  show  that  Q  is  countable.  Turn  the  outline 
below  into  a  proof  by  describing  the  counting  process.  (Don’t  try  to  find  a  formula 
for  the  function.) 

Proof.  [Outline  of  proof]  The  proof  is  simplest  if  we  show  that  the  set  of  positive 
rationals,  Q ' ,  is  countably  infinite.  You  showed  in  the  exercises  (and  it  is  easy  to 
see)  that  «  Q+.  Then  Q  =  Q+  UQ_  U  {0}  is  infinite  and  countable,  so  Q  «  N. 
So  we  will  restrict  our  attention  to  Q+.  To  see  that  Q+  is  countable,  we  will  make 
a  chart  of  all  the  fractions  of  the  form  m/n  where  m  and  n  are  positive  integers;  that 
is,  we  consider  the  following  array  of  numbers: 


12  3  4 
2  2  2  2 


12  3  4 
3  3  3  3 


12  3  4 
4  4  4  4 


Try  counting  the  elements  in  the  array  in  an  orderly  fashion.  Make  sure  you  don’t 
count  numbers  twice!  □ 

Problem  23.13.  The  set  Z[\/— 5]  is  a  subset  of  the  complex  numbers  defined  by 
Z[\/— 5]  =  {a  +  y/5 bi :  a,b  £  Z}  C  C,  and  it  is  called  a  quadratic  integer  ring.  Is 
Z [\/  -5]  countable  or  uncountable?  Prove  your  answer. 

Problem  23.14.  Let  n  £  Z+  and  denote  by  N"  =  N  X  N  x  . . .  X  N  {n  times).  Prove 
that  N"  is  countable  for  all  n  £  Z+. 

Problem  23.15.  We  denote  by  N°°  =  {(ai  .aj.u-}-  ■■■)'■  uj  £  N  for  all  j  £  Z+}.  Prove 
that  N°°  is  uncountable. 

Problem  23.16.  Let  X  and  Y  be  two  nonempty  finite  sets  and  denote  by  .^(X .  Y)  the 
set  of  all  functions  / :  X  — >  Y.  Is  f¥{X .  K)  finite,  countably  infinite,  or  uncountable? 
Prove  your  answer. 
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Problem  23.17.  Prove  that  the  set  A  =  [m2  :  m  £  Z}  is  countably  infinite. 

Problem  23.18.  Prove  that  if  A  n  B  =  0,  then  ^(AUfi)  ~  &(A)  x 
Problem  23.19.  Prove  the  following  equivalences: 

(a)  (0,1)*  [0,1)  and 

(b)  (0, 1)  «  (0, 1/2)  U  [1,2). 

Problem  23.20.  Prove  the  following  generalization  of  Exercise  21.12. 

Theorem  23.13.  IfAj  is  finite  for  all  j  £  Z+,  then  U  jeZ+Aj  is  countable. 

Note  that  induction  will  not  work  here.  We  suggest  that  you  adapt  the  ideas  of  the 
alternate  proof  of  Theorem  23.11  outlined  in  Problem  23.12.  As  a  second  note,  we 
mention  that  this  theorem  could  be  generalized  to  allow  the  sets  Aj  to  be  countable. 
However,  a  proof  of  this  has  a  subtlety  and  requires  something  we  have  not  yet 
discussed — something  known  as  the  axiom  of  choice.  See  Project  29.12. 

Problem  23.21.  Prove  that  the  set  of  all  decreasing  functions  from  N  to  N  is  count¬ 
able. 


Chapter  24 

The  Cantor-Schroder-Bernstein  Theorem 


No  one  shall  be  able  to  chase  us  out  of  the  paradise  that  Cantor  created  for  us.1 — David 

Hilbert  [52,  p.  170] 

Suppose  we  have  two  finite  sets.  We  have  developed  enough  machinery  to  tell 
when  one  set  has  more  elements  than  another.  But  what  about  infinite  sets?  For 
example,  we  might  consider  N  and  Z+,  and  we  may  ask  which  one  has  more  el¬ 
ements.  Well,  we  have  already  developed  mathematical  concepts  that  convince  us 
that  these  two  sets  have  the  same  number  of  elements.  In  this  chapter,  we  investigate 
the  situation  for  general  infinite  sets. 

The  next  few  definitions  will  seem  natural,  but  there’s  more  to  them  than  meets 
the  eye!  Given  two  sets  A  and  B,  we  say  that  they  are  of  equal  cardinality,  written 
|A|  =  |ZJ|,  if  the  sets  are  equivalent.  Recall  that  this  means  that  there  is  a  bijective 
function  /  :  A  — ►  B.  For  two  sets,  A  and  B  we  say  that  the  cardinality  of  A  is  less 
than  or  equal  to  the  cardinality  of  B,  written  ,4  <  |Z?|,  if  there  is  an  injection 
/  :  A  — >  B.  If  our  two  given  sets  A  and  B  satisfy  |A|  <  |Z?|  and  |A|  ^  |B|,  then  we 
say  that  the  cardinality  of  A  is  less  than  the  cardinality  of  B  and  we  will  write 
|A|  <  |fi|. 

As  we  mentioned  above,  we  have  already  said  what  it  means  for  two  finite  sets 
A  and  B  to  satisfy  |A|  =  |ZJ|  or  |A|  <  |ZJ|,  because  according  to  Definition  22.1  the 
cardinality  of  a  finite  set  is  a  natural  number  and  we  understand  when  one  natural 
number  is  smaller  than  or  equal  to  another.  Does  our  notion  of  <,  defined  above  for 
general  sets  A  and  B,  agree  with  our  definition  of  <  for  the  cardinality  of  two  finite 
sets?  Of  course  it  does!  You  will  have  the  opportunity  to  show  this  in  Problem  24.6. 

Exercise  24.1.  Explore  the  definitions  above  for  Z  and  R:  Is  |Z|  <  |M|?  Is  |Z|  <  |R|? 

O 

Exercise  24.2.  Explore  the  definitions  above  for  Z  and  ^(Z):  Is  |Z|  <  | ^2>(Z) | ?  Q 


1  The  translation  is  ours. 
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In  the  exercise  above,  we  didn’t  ask  you  whether  jZj  <  \Xg(Z)\.  Before  you  read 
on,  think  about  it.  The  longer  you  think  about  this  question,  the  easier  it  will  be  to 
follow  the  proofs  below. 

We  would  like  our  cardinality  relation  to  have  the  usual  nice  properties  of  a  partial 
order.  The  relation  is  a  partial  order,  but  it’s  going  to  require  some  work  on  our  part 
to  prove  this.  (In  fact,  it  is  a  total  order,  but  you  will  need  to  work  Project  29.12  in 
order  to  see  this!) 

We  have  many  examples  of  finite  sets  and  many  examples  of  countably  infinite 
sets.  We  don’t,  however,  have  a  long  list  of  uncountable  sets  that  are  intrinsically 
different  from  R.  We’ll  fix  that  by  proving  the  theorem  below. 

Theorem  24.3  (Cantor).  For  any  set  X,  we  have  |X|  <  \XX(X)\. 

The  main  part  of  the  proof  will  be  to  show  that  there  is  no  bijection  between 
X  and  XX  (X).  Before  we  turn  to  this,  let’s  think  about  whether  or  not  we’ve  seen 
something  like  this  before.  Have  we  seen  a  proof  in  which  we  showed  that  there 
is  no  bijection  between  two  infinite  sets?  Absolutely:  When  we  showed  that  R  is 
uncountable,  we  used  Cantor's  diagonal  argument.  Take  a  moment  to  look  back  at 
the  proof  of  Theorem  23.1 1  and  try  to  adapt  that  proof  to  the  theorem  below.  This 
is  exactly  what  Cantor  did. 

Proof.  The  function  /  :  X  — >  fX{X)  defined  by  f(x)  =  { w }  is  clearly  an  injection, 
so  we  have  |X|  <  \Xg(X)\. 

Suppose,  to  the  contrary,  that  \X  \  =  \,<X(X)\.  By  definition,  there  is  a  bijection 
g  :  X  — >  XX (X).  In  particular,  g  is  surjective.  (Note  that  g  maps  elements  in  X  to 
sets,  so  g(x)  will  be  a  set.)  Consider  the  set  A  =  {x  G  X  :x^  g(jc)}.  Since  A  G  XX  (X) 
and  we  suppose  g  to  be  onto,  there  exists  a  G  X  with  g(a)  =  A.  Is  a  G  A?  If  it  is, 
then  a  G  A  =  g(a).  So  a  G  g(a)  and  if  we  look  back  at  the  definition  of  A,  we  see 
that  a  ^  A,  a  contradiction.  The  only  other  possibility  is  that  a  (f  A  =  g(a).  But  let’s 
return  to  the  definition  of  A:  Since  a  g(a ),  we  see  that  a  G  A!  This  is  again  a 
contradiction,  and  we  conclude  that  there  is  no  such  bijection  g.  □ 

Choosing  various  infinite  sets  for  X,  we  obtain  different  examples  of  uncountable 
sets. 

Corollary  24.4.  The  set  AXifi)  is  uncountable. 

The  cardinality  of  XX (X )  depends  only  on  the  cardinality  of  X  (and  you’ll  explain 
why  this  is  the  case  in  the  next  exercise).  In  particular,  this  implies  that  |^(N)| 
=  |  £X(Z)  |  =  |  X^(Q)  |,  to  compare  just  a  few  of  our  favorite  sets. 

Exercise  24.5.  Show  that  if  for  two  sets  X  and  Y  we  have  A  =  |F|,  then  \XX(X)\ 

=  \&’(Y)\-  O 

How  do  the  two  uncountable  sets  R  and  ^(N)  compare?  We  will  be  able  to  show 
that,  in  fact,  ,S^(N)  <  |R|,  and  |R|  <  |^2(N)|.  Now  this  notation  certainly  suggests 
that  we  can  conclude  that  the  two  cardinalities  are  the  same,  but  we  need  to  be  very 
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careful  here — we  are  not  comparing  natural  numbers.  We  are  saying  that  we  have 
an  injection  from  ^(N)  to  R.  and  an  injection  from  R  to  ^(N).  To  conclude  that 
the  two  cardinalities  are  equal,  we’ll  need  to  construct  a  bijection  out  of  the  two 
injections.  This  is  the  content  of  the  following  very  important  theorem. 

Theorem  24.6  (Cantor-Schroder-Bernstein).  If  A  and  B  are  two  sets  such  that 
|A|  <  |B|  and  |Z?|  <  |A|,  then  |A|  =  |B|. 

Our  basic  proof  strategy  will  be  as  follows:  Our  assumption  implies  that  we  can 
start  with  two  injections,  /  :  A  — >  B  and  g  :  B  — >  A.  Now  we  partition  each  of  the 
sets  A  and  B  into  the  same  (finite)  number  of  sets  (it  will  be  three  in  our  case). 
Then  we  will  pair  up  the  sets  in  the  partitions  in  such  a  way  that  the  restriction 
of  the  appropriate  injection  is  actually  a  bijection.  Finding  just  the  right  kind  of 
partition  of  each  set  is,  in  fact,  ingenious  and  it  is  the  reason  why  the  names  of  three 
major  mathematicians  are  associated  with  this  theorem.  Two  different  proofs  are 
outlined  in  Problems  24.13  and  24.14.  The  first  of  these  proofs  is  shorter  than  the 
one  we  give  here.  We  have  chosen  the  proof  below,  because  we  find  it  particularly 
clear  and  enlightening.  We  hope  you  feel  the  same  way  about  it.  It  is  adapted  from 
Halmos  [41], 

Proof.  Since  | A |  <  \B\,  there  is  an  injection  /  :  A  — >  B  and  since  \B\  <  |A|,  there  is 
an  injection  g  :  B  — » A. 

We  will  define  three  subsets  A„,A/,,  and  A^  of  A.  An  element  of  A  will  belong  to 
exactly  one  of  these  subsets  depending  on  something  we  will  call  its  length,  a  notion 
we  now  define. 

First  note  that  since  g  is  injective  we  have  that  for  any  z  £  A,  the  set  g_1({z}) 
is  either  empty  or  a  set  with  exactly  one  element.  Likewise,  for  any  z  &  B  the  set 
/“ 1  ({z})  is  either  empty  or  contains  exactly  one  element.  This  allows  us  to  define 
for  each  rgA  a  sequence  (x„)  with  terms  from  A  U  B  as  follows.  We  take 


XQ  =  x. 


If,  in  the  following,  the  set  on  the  right  is  not  empty,  then  we  define  x„  as  indicated 
below: 


xt  €  ^  1  ({xro}) 
*2  €  ./"’({  vi}) 
x3  €  g_1({x2}) 


If,  at  some  point,  the  set  on  the  right-hand  side  is  empty,  then  the  sequence  stops.  In 
order  to  visualize  this,  consider  the  diagram  below  beginning  on  the  right: 


8  f 


8 
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If  k  is  the  first  positive  integer  for  which  the  inverse  image  of  {x&_  i }  is  empty, 
then  the  sequence  is  finite  (the  diagram  above  stops)  with  k  terms.  Note  that  for 
every  x  g  A,  the  sequence  (x„)  is  of  length  at  least  one,  though  it  may  be  finite  or 
infinite. 

We  now  define  the  three  promised  sets,  Aa.Ah-  and  Arx,: 

Aa  =  {x  g  A  :  (xn)  has  an  odd  number  of  terms}, 

Ab  =  {x  g  A  :  (xn)  has  an  even  number  of  terms}, 

Aoo  =  {xeA:  (x„)  is  an  infinite  sequence}. 

Evidently,  A  =  Aa  U  A  /,  LJ  A  «,  and  the  three  sets  are  pairwise  disjoint. 

For  each  y  g  B.  we  define  a  sequence  (yn)  with  terms  i  11 ,4  U  B  analogously.  We 
take 


To  =  y- 

If,  in  the  following,  the  set  on  the  right  is  not  empty,  then  we  define  yn  as  indicated 
below: 


yi  €  f~\{yo}) 

T2  e  ^  1 ({ni}) 

T3  e  r\{y2}) 


If,  at  some  point,  the  set  on  the  right-hand  side  is  empty,  then  the  sequence  stops.  In 
order  to  visualize  this,  consider  the  diagram  below  beginning  on  the  right: 

f  8  f 

•  ••b  *yi  *y\  >  yo =  y- 

If  k  is  the  first  positive  integer  for  which  the  inverse  image  of  {y^_  i }  is  empty, 
then  the  sequence  is  finite  with  k  terms.  Note  that  for  any  y  g  B,  the  sequence  (y„) 
is  of  length  at  least  one,  though  it  may  be  finite  or  infinite.  We  then  define 

Bj,  =  {y  g  B  :  (y„ )  has  an  odd  number  of  terms}, 

Ba  =  {y  g  B  :  (y„)  has  an  even  number  of  terms}, 

Boo  =  {y  g  B  :  (y„)  is  an  infinite  sequence}. 

Evidently,  B  =  Bb  U  Ba  U  Boo  and  the  three  sets  are  pairwise  disjoint.  (Our  notation 
suggests  that  we  will  pair  Aa  and  Ba,  Ab  and  Bb,  and  A„  and  Boo.) 

Note  that  the  sequences  corresponding  to  x  g  A  and  y  g  B  are  defined  according 
to  the  same  rule  and  in  such  a  way  that  every  term  depends  only  on  the  term  before. 
Thus,  if  two  sequences,  one  corresponding  to  x  g  A  and  the  other  to  y  g  B,  take  on 
the  same  value  at  a  term  with  an  even  index  for  (x„)  and  a  term  with  an  odd  index 
for  (y„)  or  vice  versa,  then  the  tails  of  the  two  sequences  are  the  same  from  there 
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on.  (If  at  this  point  you  are  confused  about  the  definition  of  these  two  sequences, 
we  recommend  taking  a  short  break  from  the  proof  and  working  Exercises  24.7  and 
24.8  immediately  following  the  proof.  After  working  the  exercises,  you  should  find 
the  proof  easier  to  read.) 

We  will  now  compare  A„  with  Ba,  A/,  with  /!/,,  and  A„  with  B,„.  We  will  need  to 
handle  the  case  in  which  Aa  =  0  separately  from  the  case  in  which  it  is  nonempty. 

Claim  1 .  If  Aa  /  0,  then  /  |Aa  :  Aa  — >  Ba  is  a  bijection. 

Let  xgA.  Then  y  =  f{x)  £  B.  Let’s  compare  the  two  sequences,  (x„)  and  (y„): 
The  sequence  corresponding  to  y  is  (y„)  =  (y,  x,  V2-  ■  . .),  while  the  sequence  cor¬ 
responding  to  x  is  (x,  xi,  . . .).  Once  x  appears,  the  subsequent  terms  will  all  be  the 
same.  In  other  words,  and  this  will  be  crucial  in  what  follows,  Xk  =  )!k-\  i  for  k  >  0. 
Furthermore,  it  will  be  helpful  to  keep  in  mind  that  (x„)  has  even,  odd,  or  infinite 
length  if  and  only  if  (y„)  has  odd,  even,  or  infinite  length,  respectively. 

First  we  check  that  /  |A(I  maps  Aa  onto  Ba.  To  this  end,  we  recall  that  if  x  £  Aa, 
then  the  sequence  (x„)  is  of  odd  length.  From  the  discussion  in  the  last  paragraph, 
the  sequence  (y„)  is  of  even  length;  that  is,  y  £Ba.  This  implies  that  ran(/  |Ao  )  C  Ba. 
For  the  reverse  inclusion,  if  y  £  Ba,  then  the  sequence  (y„)  is  of  even  length  and  has 
at  least  two  terms  (y,  y\ ,  . . .).  We  set  x  =  y\  and  note  that  y  =  /(x).  As  above,  the 
sequence  of  x  satisfies  Xk  =  y^+i  for  k>  0.  Since  (yn)  is  of  even  length,  (x„)  is  of 
odd  length  and  thus  x  £  Aa.  This  shows  that  Ba  Cran(/|AJ. 

Thus,  we  now  know  that  /  |An  :  Aa  — >  Ba  is  well-defined  and  surjective.  Since 
f\Aa  is  the  restriction  of  an  injection,  it  is  also  one-to-one,  establishing  the  first 
claim. 

Claim  2.  If  Aa  =  0,  then  Ba  =  0. 

Suppose,  to  the  contrary,  that  Ba  ^  0.  Then  there  is  y  £  B  such  that  the  sequence 
(yn)  is  of  even  length  and,  in  particular,  of  length  at  least  two.  Then  x  =  y  i  £  A  and 
the  sequence  (x„)  satisfies  x*  =  yk  -•  i  for  all  k  >  0.  In  particular,  (x„)  is  of  odd  length 
and  thus  Aa  /  0.  This  contradiction  establishes  the  second  claim. 

These  results  imply  that  Aa  «  Ba.  In  entirely  analogous  ways  we  can  show  that 
g\sb  ■  By  — » Ay  and  /  |Aoo  :  Aoo  — »  Bx  are  bijections.  Hence  we  also  have  A b  ~  Bb 
and  Aoo «  Boo. 

By  Theorem  21.6,  we  have  A  =  AaUA^UAoo  ~  BbUBa  UTLo  =  B.  Thus  |  A  |  =  |B|. 


□ 


Exercise  24.7.  Since  |N|  <  \Z\  and  \Z\  <  |N|,  there  exist  two  injective  functions 
/ :  N  — >  Z  and  g  :  Z  — >  N.  We  will  make  the  following  choices  for  /  and  g: 


f{x)  =  2x  and 


In  what  follows,  we  refer  to  the  notation  from  the  proof  of  Theorem  24.6. 
(a)  Find  the  three  sequences  defined  for  0, 3,  and  8  in  N. 
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(b)  Find  the  three  sequences  defined  for  0,  —5,  and  10  in  Z.  O 

Exercise  24.8.  Continuing  the  exercise  above  and  using  the  notation  from  the  proof 
of  Theorem  24.6,  describe  the  set  N*.  O 


Exercise  24.9.  Using  the  notation  of  the  proof  above,  show  that  A,„  ~  B„.  O 

It’s  now  possible  to  show,  as  we  ask  you  to  in  Problem  24.4,  that  the  relation  <, 
as  defined  in  this  chapter, “almost”  satisfies  the  conditions  of  a  partial  order;  that  is, 
it  is  reflexive,  transitive,  and  “almost”  antisymmetric.  Choosing  the  proper  set  on 
which  to  define  the  relation,  we  can  make  it  a  true  partial  order  and,  in  fact,  a  total 
order.  More  on  that  appears  in  Project  29.12. 

We  will  ask  you  to  show,  in  the  exercise  below,  that  there  are  infinitely  many 
uncountable  sets  all  of  different  cardinality.  So  there  are  not  merely  two  different 
infinities — there  are  at  least  countably  infinitely  many  of  them.  At  the  time  of  Can¬ 
tor’s  construction  this  theory  was  not  easy  for  people  to  accept.  To  learn  more  about 
this,  see  the  spotlight  on  the  continuum  hypothesis. 

Exercise  24.10.  Using  Theorems  24.3  and  24.6,  show  that  there  are  infinitely  many 
uncountable  and  (mutually)  nonequivalent  sets.  O 


We  finish  the  chapter  by  following  through  on  our  vow  to  compare  the  sizes  of 
our  “original”  uncountable  sets,  R  and  ^(N). 

Theorem  24.11.  The  sets  R  and  &(N)  have  the  same  cardinality. 

Proof.  We  define  a  function  /  :  R  — >  ^(Q)  by  f{x )  =  {y  £  Q  :  y  <  a}.  For  each 
x  £  R,  the  set  f{x)  is  uniquely  determined  and  f{x)  C  Q.  Thus  /  is  a  well-defined 
function. 

We  will  show  that  /  is  one-to-one.  Let  x,y  £  R  with  a  f  y.  Without  loss  of  gen¬ 
erality,  we  may  assume  that  x  <}•  By  Theorem  12.12,  there  is  a  rational  number  c 
with  x  <  c  <y.  But  c  qL  f(x)  while  c  £  /(y),  so  f{x)  ^  /(y).  This  shows  that  /  is 
injective. 

Using  the  result  of  Exercise  24.5  we  conclude  that  |R|  <  |52(Q)|  =  |^0(N)|. 

For  the  other  direction  we  define  a  function  g  :  /^(W)  — >  R  by 


<?(A)  = 


where  aj  = 


1  if  j  £  A 
Oif  j#A  ' 


For  each  A  £  ^(N),  we  have  a  unique  series.  Using  the  comparison  test  (com¬ 
paring  to  the  convergent  geometric  series  LJ=0  Jf)  we  see  that  LJ=o  ff  converges. 
Consequently,  g(A)  £  R  and  is  unique.  Hence  the  function  g  is  well-defined. 

To  show  that  g  is  one-to-one,  consider  A,B  £  ^(N)  with  A^f  B.  By  the  well¬ 
ordering  principle  of  N,  there  is  a  smallest  integer  k  that  belongs  to  one  of  the  two 
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sets  but  not  the  other.  Without  loss  of  generality  we  may  assume  that  k  £  A  and 
k  B.  Using  the  notation  g(A)  =  £“=0  and  g(B)  =  £7=0  we  note  that  since  k 
is  the  smallest  integer  in  A  \ B,  we  have  a;  =  b ;  for  j  =  0, 1, . . . , k  —  1 .  Now  we  use 
this  and  properties  of  geometric  series  to  calculate: 


m  =  b‘ 


7=o 


37 


_  J_  _ ^7 

J  j=k+ l  J 

>4-  £  - 

-  3*  .  4-  ,  37 


7=*+l 

1  1 


>0. 


3*  2(3*) 

Thus  g(A)  ^  g(B)  and  g  is  an  injection.  This  implies  that  |7?2(N)|  < 
We  apply  Theorem  24.6  to  finish  the  proof. 


In  this  text,  we  have  defined  what  it  means  for  the  cardinality  of  two  infinite  sets, 
A  and  B,  to  be  less  than  or  equal  to  and  equal  to.  It  is  also  possible  (with  significantly 
more  effort)  to  define  the  cardinal  number  of  an  arbitrary  set.  When  this  is  done,  it  is 
common  to  introduce  special  symbols  for  the  cardinality  of  R  and  IT  The  cardinal 
number  |R|  is  denoted  by  c  and  is  called  the  continuum,  while  |N|  is  denoted  by 
No  (where  N  is  the  Hebrew  letter  aleph).  The  symbol  No  is  read  “aleph-nought.” 
If  |Xj  =  a,  we  write  \  &{X)\  =  2“.  In  Problem  22.15  you  showed  that  this  notation 
agrees  with  the  usual  one  for  finite  sets  (see  Definition  22.1  for  that  definition  of 
cardinal  number).  Problem  24.18  provides  the  motivation  for  this  power  notation. 
Using  this  notation,  we  see  that  another  way  to  state  Theorem  24.11  is  that  c  =  2K°. 
The  sets  R  and  ^(N)  are  the  smallest  uncountable  sets  that  we  introduced  in  this 
text.  Are  they  the  smallest  ones?  Some  people  believe  that  Cantor  drove  himself  into 
madness  by  his  repeated  attempts  to  prove  that  this  is  indeed  the  case  (see  [67],  for 
example).  His  conjecture,  that  there  is  no  uncountable  set  of  cardinality  strictly  less 
than  c,  is  called  the  continuum  hypothesis.  A  discussion  of  this  topic  is  in  Spotlight: 
The  Continuum  Hypothesis. 


Definitions 

Definition  24.1.  For  two  sets  A  and  B  the  cardinality  of  A  is  equal  to  the  cardi¬ 
nality  of  B,  written  |A  |  =  \B\,  if  there  is  a  bijective  function  /  :  A  — >  B. 

Definition  24.2.  For  two  sets  A  and  B  the  cardinality  of  A  is  less  than  or  equal  to 
the  cardinality  of  B,  written  |A|  <  \B\,  if  there  is  an  injective  function  /  :  A  — »  B. 
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If,  in  addition,  |A  ^  \B\,  then  we  say  the  cardinality  of  A  is  less  than  the  cardinality 
of  B  and  we  write  |A|  <  \B\. 


Solutions  to  Exercises 

Solution  (24.1).  Let  /  :  Z  — >  R.  be  defined  by  f(n)  =  n.  Then  /  is  obviously  an 
injection,  so  |Z|  <  |M|.  We  know  from  Theorem  23.11  that  |Z|  ^  |K|,  therefore 
|Z|<|K|. 

Solution  (24.2).  We  claim  that  |Z|  <  |^a(Z)|.  To  see  this,  define  /  :  Z  — >■  ^(Z)  by 
f(n)  =  {«}.  Then /is  an  injection  and  therefore  |Z|  <  |^(Z)|. 

Solution  (24.5).  Since  X  =  |F|,  there  is  a  bijection  / :  X  — »  T.  We  define  a  func¬ 
tion  F  :  &>(X)  — >  &(Y),  by  F(A)  =  {/(a)  :  a  G  A}  for  A  G  ^(X). 

Since  /(a)  G  T  for  all  a  GX,  we  have  for  each  A  C  X  a  unique  F(A)  C  T  and  the 
function  F  is  well-defined. 

Suppose  that  for  A,B  G  ^(X)  we  have  F(A)  =  F(Z?).  If  z  G  A,  then  /(z)  G  F(A) 
and  thus  /(z)  G  F(5).  That  is,  /(z)  =  f(w)  for  some  w  G  B.  Since  /  is  one-to-one, 
z  =  w  and  z  G  B.  Thus  A  C  B.  Entirely  analogously  one  shows  that  B  C  A.  Hence 
A  =  B  and  the  function  F  is  one-to-one. 

Let  B  G  &(Y).  We  define  A  =  {/-1(jc)  :  x  G  5}.  Note  that  /~*  :  T  — >  X  exists 
since  /  is  a  bijection.  We  claim  that  F(A)  =  B.  If  x  G  B,  then  /  1  (x)  G  A  and  thus 
x  =  /(Z*1  (x))  G  F(A).  For  the  converse,  if  x  G  F(A),  then  x  =  /(y)  for  some  y  G  A. 
Then  y  =  /  1  (z)  for  some  z  G  B.  Thus  x  =  /(y)  =  z  and  x  G  B.  This  establishes  the 
claim  and  shows  that  the  function  F  is  onto. 

We  conclude  that  our  function  F  :  £?(X)  — »  ( Y )  is  a  bijection  and  hence 

|^(X)|  =  |^(F)|. 

Solution  (24.7). 

(a)  We  have  (0„)  =  (0,  0,  0,  ...),  (3„)  =  (3,  -2),  and  (8„)  =  (8,  4,  2,  1).  A 
brief  explanation  of  some  of  these  might  help:  For  0,  we  start  with  0.  Then 
we  ask  what  was  mapped  to  0  under  g  (the  answer  is  0).  Then  we  ask  what 
was  mapped  to  0  under  /.  Again,  the  answer  is  zero.  For  3,  we  start  with  3 
and  ask  what  g  mapped  to  3.  The  answer  is  —2.  Then  we  ask  what  /  mapped 
to  —2.  The  answer  is  that  nothing  was  mapped  to  —2,  and  that  is  why  this 
sequence  ends. 

(b)  We  have  (0„)  =  (0,  0,  0,  ...),  (-5,,)  =  (-5),  and  (10m)  =  (10,  5,  -3). 

Solution  (24.8).  We  claim  that  N*  =  Z+.  In  Exercise  24.7,  we  saw  that  (0„)  is 
infinite.  Hence  0  ^  and  therefore  N*  C  Z+. 

Conversely,  suppose  that  x  G  Z+.  We  first  consider  the  case  when  x  is  odd.  Then 
(x„)  =  (x,  —  (x+  l)/(2))  andx  G  N*.  Now  we  break  the  case  in  which  x  is  even  into 
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two  subcases.  Note  first  that  every  even  z  £  Z+  can  be  written  as  2 kw,  where  w  is 
odd  and  A:  is  a  positive  integer.  Let  x  £  Z+. 

If  x  =  2 ky,  where  k  >  0  is  even  and  y  is  odd,  then 

(x„)  =  (x,2k~ly,  ...,  2 y,y, 

Thus,  (x„)  is  of  even  length  and  x  £  N^. 

If  x  =  2ky,  where  k  >  0  is  odd  and  y  is  odd,  then 

(x„)  =  0,  2 k~ly,  ...,  2 y,  y). 

This  sequence  is  again  of  even  length  and  hence  x  £  N /, .  Thus  Z+  C  N h  and  the 
claim  is  established. 

Solution  (24.9).  Claim  1 .  If  A<*,  ^  0,  then  /  \a„,  :  Ar„  — >  B„  is  a  bijection. 

Let  x  £A.  Then  y  =  f{x)  £  B  with  corresponding  sequence  ( yn )  =  (y,  x,  y2,  •  •  •) 
while  the  sequence  of  x  is  (x,  xi,  . . .)  and  thus  Xk  =  yk-  \  for  k>  0.  Therefore,  if 
x  £  A0 o,  then  the  sequence  (x„ )  is  infinite  and  hence  (y„ ')  is  also,  that  is,  y  £  B„,.  This 
implies  that  ran(/ )  C  Boo.  Ify  £  Boo,  then  the  sequence  (y„)  is  infinite:  (y,  yi ,  . . .). 
We  set  x  =  yi  and  note  that  y  =  /(x).  As  above  the  sequence  of  x  satisfies  Xk  =  yk+ 1 
for  A:  >  0.  Since  (y„)  is  an  infinite  sequence,  (x;!)  is  also  and  thus  x  £  Aoo.  This  shows 
that  Boo  C  ran(/ ). 

We  have  now  shown  that  /  :  A„  — >  Br,,  is  well-defined  and  surjective.  Since 

/| a„  is  the  restriction  of  an  injection,  it  is  also  one-to-one.  This  establishes  the  first 
claim. 

Claim  2.  If  Aoo  =  0,  then  Boo  =  0. 

If  Boo  i=-  0,  then  there  exists  y  £  B  such  that  the  sequence  (y„)  is  infinite.  Then 
x  =  yi  £  A  and  the  sequence  (x„)  satisfies  x/.  =  yk+\  for  all  A:  >  0.  In  particular,  (x„) 
is  infinite  and  thus  Aoo  /  0.  This  establishes  the  second  claim. 

The  two  claims  together  show  that  Aoo  «  Boo. 

Solution  (24.10).  We  will  define  recursively  a  sequence  of  sets  as  follows: 

Xo  =  N  and  Xn+i  =  £P(Xn)  for  n  >  0. 

It  is  clear  that  all  sets  are  infinite,  but  suppose  that  not  all  of  them  have  different  car¬ 
dinality.  Then  there  are  natural  numbers  j  and  k  with  j  <  k  such  that  | Xj  \  =  A/.|.  In 
Problem  24.4  you  will  show  thatforall  sets  A, B,C  with  |A|  <  |Bj  and  \B\  <  |C|,  you 
can  conclude  that  |A|  <  |C|.  Using  this  fact  repeatedly  together  with  Theorem  24.3 
we  get  \Xj+\ |  <  \Xk\  =  \Xj\.  Theorem  24.3  also  implies  that  \Xj\  <  \  £?(Xj)  \  =  \Xj+  \  |. 
By  Theorem  24.6  we  conclude  that  |X7j  =  Xj .  1 1 .  This  contradicts  Theorem  24.3  and 
shows  that  no  two  sets  have  the  same  cardinality  and  all  but  Xq  are  uncountable. 
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Spotlight:  The  Continuum  Hypothesis 

The  origin  of  set  theory  is  generally  attributed  to  Georg  Cantor  and  Richard 
Dedekind.  In  1872,  the  two  men  met  while  on  vacation  in  Interlaken,  Switzerland. 
The  result  of  the  meeting  was  a  long-lasting  correspondence  in  which  many  of  the 
new  ideas  were  discussed.  Cantor  was  somewhat  isolated  at  the  University  of  Halle 
(his  place  of  employment)  and  this  long-distance  collaboration  was  a  great  help  to 
him.  In  an  attempt  to  understand  the  infinite  as  a  workable  concept.  Cantor  arrived 
at  a  hierarchy  of  infinities.  His  work  was  understood  by  few  and,  in  fact,  it  was 
opposed  by  many  of  the  leading  mathematicians  at  the  time.  In  particular,  a  bitter 
antagonism  between  Cantor  and  his  former  teacher,  Leopold  Kronecker,  developed. 
In  addition,  in  philosophy  of  religion  some  people  understood  Cantor’s  construction 
of  many  infinities  as  a  threat  to  monotheism:  one  god — the  infinite  was  jeopardized 
and  the  path  to  polytheism  opened. 

The  concept  of  many  infinities  led  to  questions  within  the  mathematical  theory  of 
cardinality  of  sets.  Cantor  wondered  whether  the  set  of  real  numbers  was  of  small¬ 
est  possible  size  of  uncountable  infinity;  that  is,  is  there  an  uncountable  set  that  has 
“smaller  size”  than  the  real  numbers?  Cantor’s  conjecture  is  known  as  the  Contin¬ 
uum  Hypothesis: 

There  is  no  set  A  such  that  |N|  <  |X|  <  |R|. 

Cantor’s  attempts  to  prove  or  disprove  this  hypothesis  coincided  with  his  first 
recorded  attack  of  depression.  He  suffered  from  a  bipolar  disease  that  resulted  in 
manic  depression,  periods  of  stays  in  an  insane  asylum,  and  the  inability  to  work 
on  mathematics.  In  between  he  worked  on  the  continuum  hypothesis,  fearful  that  he 
would  never  be  able  to  answer  the  question  posed.  He  announced  a  proof,  only  to 
retract  it  a  few  months  later.  Then  he  announced  a  refutation,  but  had  to  concede  the 
incompleteness  of  his  arguments  for  that  as  well. 

The  continuum  hypothesis  survived  the  nineteenth  century  unresolved.  David 
Hilbert,  in  his  famous  address  to  the  International  Congress  of  Mathematicians  in 
Paris  in  1900,  gave  a  list  of  mathematical  problems  [50]  that  were  of  foremost  inter¬ 
est  to  the  discipline.  (See  also  Spotlight:  Hilbert’s  Seventh  Problem  on  page  359.) 
The  very  first  problem  he  stated  was  the  continuum  hypothesis.  This  undoubtedly 
magnified  the  number  of  mathematicians  attempting  to  conquer  the  hypothesis.  And 
what  happened  to  Cantor  during  this  time?  Though  Cantor  would  live  for  17  more 
years,  teach  courses,  and  give  a  few  lectures  on  the  paradoxes  that  his  theory  cre¬ 
ated,  he  also  spent  the  rest  of  his  life  fighting  his  mental  illness.  Sadly,  Hilbert’s 
quote  (stated  on  page  261),  showing  the  mathematical  community’s  acceptance  and 
even  admiration  of  Cantor’s  work,  did  not  appear  until  eight  years  after  his  death. 

And  what  about  the  continuum  hypothesis  (or  CH  as  it  is  often  called)?  Before 
proving  or  rejecting  CH,  a  clear  statement  of  what  is  allowed  in  such  a  proof  or 
refutation  had  to  be  established.  By  1930,  the  framework  that  was  quite  universally 
accepted  was  the  axiomatic  system  of  Ernst  Zermelo  and  Abraham  Fraenkel  (ZF) 
together  with  the  axiom  of  choice  (AC)  (see  the  Appendix  for  the  exact  statements). 
We’ll  refer  to  this  system  as  ZFC  for  short.  A  resolution  of  the  CH  question  came  in 
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two  stages  and  in  a  surprising  way:  In  1940  Kurt  Godel  proved  that  if  ZFC  does  not 
contain  a  contradiction — that  is,  it  is  consistent — then  so  are  ZFC  and  CH.  Twenty 
years  later  Paul  Cohen  showed  that  if  ZFC  is  consistent,  so  are  ZFC  and  -CH.  These 
two  major  mathematical  results  of  the  twentieth  century  ended  efforts  to  decide 
about  the  continuum  hypothesis;  the  question  of  whether  CH  is  true  or  false  cannot 
be  answered  within  ZFC. 

For  an  enjoyable  dramatization  of  the  heroic  battles  of  Cantor  and  Godel  (to¬ 
gether  with  Boltzman  and  Turing)  we  recommend  the  BBC  documentary  Dangerous 
Knowledge  [67]. 

The  Clay  Institute’s  millennium  questions,  a  centennial  update  of  Hilbert’s  prob¬ 
lems,  no  longer  contain  the  continuum  hypothesis.  So  is  the  subject  dead?  Not  at 
all.  Both  proofs  about  the  consistency  of  CH  and  -CH  with  ZFC  brought  with  them 
completely  new  techniques.  In  the  first  case,  “Godel  numbering”  was  introduced. 
In  the  second  case,  the  technique  of  “forcing”  was  developed.  This  technique  can 
be  thought  of  as  a  way  to  introduce  additional  sets  without  losing  control  over  the 
axioms.  This  has  turned  out  to  be  a  major  tool  for  proving  consistency  and  inde¬ 
pendence  results.  With  this  as  well  as  other  techniques,  set  theory  has  developed 
tremendously  in  the  last  century.  The  demands  of  other  fields  of  mathematics  led  set 
theorists  like  W.  Hugh  Woodin,  Ian  Mycielski,  and  many  others  to  look  for  more  ax¬ 
ioms.  Many  of  the  ZFC  axioms  postulate  ways  to  create  new  sets.  These  axioms  can 
be  extended  with  large  cardinal  axioms,  or  ways  to  produce  bigger  sets.  Another 
strategy  is  to  bring  the  following  idea  into  the  picture:  Let  A  C  R.  In  a  countably 
infinite  game,  two  players  take  turns  in  choosing  decimals  of  a  real  number.  If  at  the 
end  the  number  is  in  A,  player  I  wins,  otherwise  player  II  wins.  If,  for  either  player, 
a  winning  strategy  exists,  then  the  set  A  is  called  determined.  The  axiom  of  determi- 
nacy  declares  all  subsets  of  ffi.  to  be  determined.  However,  this  axiom  contradicts  AC 
and  therefore  cannot  be  used  together  with  ZFC  in  order  to  decide  on  the  validity  of 
CH. 

Perhaps  the  most  promising  attempts  to  expand  ZFC  are  due  to  Saharon  Shelah. 
He  worked  with  a  new  axiom,  called  the  proper  forcing  axiom  (due  to  Jim  Baum¬ 
gartner).  Stevo  Todorcevic  showed  that  ZFC  together  with  the  proper  forcing  axiom 
imply  that  the  CH  fails  and,  in  fact,  that  there  is  exactly  one  cardinal  number  strictly 
between  Ko  and  the  continuum  c. 

However,  the  good  thing  about  ZFC  is  that  all  mathematicians  understand  the 
axioms  and  most  mathematicians  accept  them,  even  the  axiom  of  choice.  The  newer 
axioms  are  neither  widely  understood  nor  universally  accepted.  Finally,  there  is 
Godel’ s  First  Incompleteness  Theorem ,  which  implies  that  set  theorists  cannot  make 
every  sentence  provable.  But  much  remains  to  be  understood:  Reverse  mathematics, 
for  example,  is  a  program  that  takes  mathematical  results  and  tries  to  understand  the 
axioms  necessary  to  prove  them.  Thus,  we  can  expect  many  interesting  discoveries 
in  the  future.  (A  summary  of  the  history  of  set  theory  through  the  present  for  the 
advanced  reader  is  in  [31,  pp.  1-92].) 
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Problems 

Problem  24.1.  Let  A  and  B  be  subsets  of  a  set  A. 

(a)  Prove  that  \A  x  B\  =  \B  x  A\. 

(b)  Prove  that  if  B  ^  0,  then  |A|  <  \A  X  B\. 

Problem  24.2.  Prove  each  of  the  following,  by  defining  an  appropriate  function  be¬ 
tween  the  two  given  sets: 

(a)  |2Z|  =  |4Z|; 

(b)  |N|<|Z|; 

(c)  |Z|<|N|; 

(d)  |N|<|R|. 

Problem  24.3.  Using  the  notation  of  the  proof  of  Theorem  24.6,  prove  that  A;,  «  Bh. 

Problem#  24.4.  Let  ,c/  be  a  nonempty  collection  of  sets.  We  define  the  following 
relation  on  ///.  For  A .  B  £  ,(//, 


A<B  if  |A|  <  |B|. 


(a)  Show  that  A  is  reflexive  and  transitive  on 

(b)  Show  that  for  A,B  £  sY ,  if  A<B  and  B  A  A,  then  A  k,  B. 

(c)  Is  ■<  a  partial  order  of  .t/7  If  it  is,  prove  it;  if  it  isn’t,  give  a  counterexample. 

Problem  24.5.  Assign  the  appropriate  integer  or  symbol  to  each  of  the  following 
cardinal  numbers,  put  them  in  increasing  order,  and  give  a  brief  explanation  for 
your  answer: 

|^(Q)|,  |^(0)|,  |^(^((0,1)))|,  | {0} | ,  |K\Q|,  |^(0))|, 

|{0,1}|,  |Q|,  |0|,  |(0, 1)|,  | [0, 1] | ,  m(0,l))|,  |^(N)),  |^({0,1})|. 

Problem*  24.6.  Let  X  and  Y  be  finite  sets  with  U  =  m,  K  =  n,  and  m  <  n.  Show 
that  according  to  Definition  24.2  we  have  |A|  <  L|. 

Problem  24.7.  Prove  the  following:  If  X  and  Y  are  sets  with  X  CY,  then  A  <  |F|. 
If  we  know  that  Ac  Y,  can  we  conclude  that  |A|  <  Y  \  ?  Prove  it  or  give  a  coun¬ 
terexample. 

Problem  24.8.  Prove  the  following:  If  A  and  Y  are  sets  with  |A|  <  \Y\,  then 
|^(A)|<|^(F)|. 

Problem  24.9.  In  each  of  the  following  parts,  give  an  example  of  such  functions  or 
explain  why  it  is  not  possible  to  do  so: 

(a)  Functions  /  and  g  such  that  /  :  Q  — >  N,  g  :  N  — >  Q,  both  are  one-to-one 
but  neither  of  them  is  onto; 
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(b)  Functions  /  and  g  such  that  / :  Q  — >  N,  g :  N  — » Q,  both  are  onto  but  neither 
of  them  is  one-to-one; 

(c)  A  function  /  :  R  — >  Q  that  is  one-to-one; 

(d)  A  function  /  :  ^(R)  — >  R.  that  is  one-to-one. 

Problem  24.10.  Let  A  =  {1,2, 3,4, . . .}  and  5={1,j,j,^,...}.  We  define  the  in¬ 
jective  functions 

f:A — >Bby  f(x)  =  — j—  and  g  :  B — >  Aby  g(x)  =  1  +5. 

x+3  x 

In  the  following  we  always  refer  to  the  notation  from  the  proof  of  Theorem  24.6  and 
the  sequences  that  we  defined  in  that  theorem. 

(a)  Find  the  sequence  that  starts  with  1/3. 

(b)  Find  the  sequence  that  starts  with  1/10. 

(c)  Consider  the  elements  2, 10,  and  15  in  A.  Find  the  corresponding  elements  in 
the  set  B  constructed  in  the  proof  of  Theorem  24.6. 

Problem  24.11.  Let  A  and  Y  be  nonempty  finite  sets  and  suppose  we  have  injections 
/  :  X  — »  Y  and  g  :  Y  — »  X  as  in  the  hypotheses  of  Theorem  24.6.  The  notation 
below  refers  to  that  of  the  proof  of  Theorem  24.6.  For  parts  (a)  and  (b)  your  answer 
will  involve  /  and  g. 

(a)  For  x  GX,  find  the  sequence  (x„). 

(b)  Fory  €  Y ,  find  the  sequence  (y„). 

(c)  Describe  the  sets  XaiXb,  and  X,„. 

(d)  Describe  the  sets  Ya.  Yf„  and  Yx.. 

Problem  24.12.  Since  |N|  <  \Z\  and  \Zj  <  |N|,  there  are  two  injective  functions 
/ :  N  — >  Z  and  g  :  Z  — >  N.  We  will  make  the  following  choices: 


f(x)  =  3x  and  g(x) 


3x  if  x  >  0 
— 3x+  1  if  x  <  0 


In  the  following  we  always  refer  to  the  notation  from  the  proof  of  Theorem  24.6. 

(a)  Find  each  of  the  five  sequences  that  start  with  the  following  elements  in  N: 
0,  1,45,  4(36),  and  5(36). 

(b)  Show  that  LL  =  {0}  and 

=  {(32s)k  :  k,s  £  N  and  k  =  1  or  k  =  3m  +  2,  m  S  N}. 


Problem  24.13.  For  an  alternate  proof  of  the  Cantor-Schroder-Bernstein  theorem 
construct  a  sequence  of  subsets  of  A  as  follows,  using  the  notation  from  the  theorem 
as  stated  in  the  text: 


Co=A\g(B),  Cj+1  =  g(f(Cj))  for  j  >  0. 


Set  C  =  U7=o  Cj  ancl  define  a  function  h  :  A  — >  B  by 
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h(x)  =  /  /W  ifxeC 

'  where  g(y)  =x  ifx  ^  C 

Finish  this  proof  by  showing  that  h  is  well-defined  and  that  it  is  a  bijection. 

Problem  24.14.  This  problem  outlines  yet  another  proof  of  the  Cantor-Schroder- 
Bernstein  theorem.  Using  the  notation  from  the  theorem  as  stated  in  the  text  we 
define  B'  =  /(A),  A'  =  g(B),  P  =  (go/) (A),  and  Q=A’\P. 

For  X  C  A  we  define  X*  =  (go  f)(X)UQ.  We  say  that  a  subset  X  of  A  is  normal 
ifX*  Cl 

(a)  Show  that  the  collection  of  normal  subsets  of  A  is  not  empty. 

(b)  Show  that  the  intersection  of  a  nonempty  collection  of  normal  subsets  is  a 
normal  subset. 

(c)  Show  that  if  X  is  a  normal  subset,  then  so  is  X*. 

(d)  We  let  N  denote  the  intersection  of  all  normal  subsets  of  A  and  define 
Y  =  P\(g  o  /)  (N) .  Prove  that  P  U  Q  =  Y  U  N. 

(e)  Prove  that  the  last  part  implies  that  |PU  Q\  =  |P|. 

(f)  Prove  the  theorem. 

Note:  This  proof  is  essentially  due  to  Peano  and  Zermelo  who  came  up  with  the 
same  idea  independently  and  published  it  independently  in  1906,  see  [34]. 

Problem  24.15.  The  set  (0, 1)  =  {x  £  JR.  :  0  <  x  <  1}  is  the  open  unit  interval.  Prove 
that  |(0, 1)|  =  |(0,1)  x  (0,1)1. 

(Cantor  proved  this  first  and  it  was  on  this  occasion  that  he  wrote  the  sentence 
quoted  on  page  25 1 .) 

Problem  24.16.  Prove  that  |R|  =  |C|,  where  C  =  {a  +  bi  :  a,b  £  1}  denotes  the 
complex  numbers.  (If  you  worked  Problem  24.15,  you  might  want  to  use  the  result 
here.) 

Problem  24.17.  We  denote  by  I  =  [0,1]  =  {x  G  R  :  0  <  x  <  1}  the  closed  interval 
from  0  to  1  and  by  &  (/)  the  set  of  all  functions  /  :  I  — >  I. 

(a)  Find  an  injective  function  g  :  &(I)  — *  PX(I  x  I).  (Prove  that  your  function 
is  well-defined  and  one-to-one.) 

(b)  Find  an  injective  function  h  :  PXii)  — >  ■'P'(J)-  (Consider  the  function  h  de¬ 
fined  as  follows:  Given  a  set  X  €  ^(7),  define  h(X)  =  %x  where  Xx  is  the 
characteristic  function  of  the  set  X.  Be  sure  to  prove  that  h  is  well-defined 
and  one-to-one.) 

(c)  Use  parts  (a)  and  (b)  together  with  the  result  of  Problem  24.15  to  prove  that 

|^(/)|  =  |^(M)|. 

Problem  24.18.  Let  X  be  a  nonempty  set.  We  denote  by  2X  the  set  of  all  functions 
/  :  X  — >  {0,1}-  Prove  that  \t?(X)\  =  |2X|.  (You  may  find  it  helpful  to  use  charac¬ 
teristic  functions  here.) 
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Problem  24.19.  Using  the  Cantor-Schroder-Bernstein  theorem,  give  an  easy  proof 
of  |(— 1, 1)1  =  |[-1, Ill- 

Problem  24.20.  A  generalization  of  the  continuum  hypothesis  says  that  for  any  in¬ 
finite  set  X,  there  is  no  set  Y  such  that  \X\  <  j Y \  <  \&(X)\.  Under  the  assumption  of 
this  generalized  continuum  hypothesis,  prove  that  a  set  X  with  at  least  two  elements 
is  infinite  if  and  only  if  there  is  no  set  Y  with  |A|  <  |F|  <  \£?(X)\. 

Problem  24.21.  See  Problem  24.20  above  for  a  statement  of  the  generalized  contin¬ 
uum  hypothesis.  Let  A  and  Y  be  sets  such  that  Y  is  countable  and  \^(X)\  <  \£Y>(Y)\. 
Assuming  the  generalized  continuum  hypothesis,  prove  that  |F|  ■ft  |A|. 


Chapter  25 

Metric  Spaces 


There  are  many  ways  to  measure  distance  in  the  spaces  in  which  we  live  and  work. 
For  example,  if  you  want  the  shortest  distance  between  two  geographical  places  (the 
distance  “as  the  crow  flies”),  you  follow  the  line  segment  joining  them.  But  in  real 
life  this  isn’t  always  possible.  If  you  are  driving  your  car  through  a  city  or  across 
your  campus,  you  need  to  go  around  solid  objects  and  not  through  them.  So  how  do 
we  calculate  distance  in  those  cases?  Measuring  distance  in  a  set  X  is  a  very  small 
(but  interesting)  part  of  a  branch  of  mathematics  known  as  “point  set  topology,”  and 
we  will  look  at  it  in  detail  in  this  chapter.  We  will  now  often  refer  to  the  elements  of 
X  as  points. 

So  let’s  go  back  to  the  first  time  you  measured  distance.  It  was  probably  in  R,  on 
a  number  line,  and  you  learned  that  the  distance  between  two  points  x  and  y  was  the 
absolute  value  of  the  difference  of  the  two  numbers.  If  we  write  d(x,y )  =  \x  —  y |, 
then  d  is  a  function  and  d  :  R  x  R  — >  R.  That’s  straightforward  enough,  but  now  we 
want  to  generalize  our  concept  of  distance.  So  let’s  turn  to  the  essential  properties 
of  a  distance  function. 

First,  distance  shouldn’t  be  negative,  so  d(x,y )  >  0  for  two  points  x  and  y,  and 
if  the  distance  satisfies  d(x,y)  =  0,  then  you  didn’t  move  anywhere,  so  x  =  y.  You 
also  surely  believe  that  distance  from  x  to  y  should  be  the  distance  from  y  to  x.  And 
finally,  in  Theorem  5.8  (Problem  5.14)  you  learned  the  triangle  inequality,  which 
said  that  “if  x  and  y  are  two  real  numbers,  then  |x  +  y|  <  |jc|  +  |y|.”  In  Problem  20.2, 
you  showed  how  to  switch  the  triangle  inequality  into  a  statement  about  distances. 
We  recall  the  result  of  that  problem  here:  For  real  numbers  x,y,  and  z, 

\x-y\  <  \x-z\  +  \z-y\. 

In  English,  this  means  that  our  path  will  be  shorter  if  we  go  directly  from  x  to 
y  as  opposed  to  taking  a  detour  through  z,  which  is  as  it  should  be.  So  we  would 
want  our  general  distance  function  to  satisfy  something  like  this  too;  that  is,  in  our 
new  “d”  notation  we  want  d(x,y)  <  d(x,z )  +d(z,y)  for  arbitrary  points  x.y,  and  z. 
So  now  we  will  define  something  that  acts  like  a  distance  on  an  arbitrary  set  X  and 
does  all  the  important  things  that  a  distance  should  do. 
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Let  X  be  a  nonempty  set.  Then  a  metric  on  X  is  a  function  d  :  X  x  X  — >  R 
satisfying  (i)-(iv)  below. 

(i)  (Nonnegativity)  For  all  x,y  £  X,  the  function  d  satisfies  d(x,y)  >  0. 

(ii)  (Definiteness)  For  all  x,y  £  X,  the  function  d  satisfies  d(x.y)  =  0  if  and  only 
if  x  =  y. 

(iii)  (Symmetry)  For  all  x,y  £  X,  the  function  d  satisfies  d(x,y)  =  d(y,x). 

(iv)  (Triangle  inequality)  For  all  x,y,z  £  X,  the  function  d  satisfies 


d(x,y )  <  d(x,z)+d(z,y). 


A  metric  is  also  called  a  distance  function.  A  nonempty  set  X  together  with  the 
metric  d  is  called  a  metric  space  and  is  denoted  by  (X,d),  or  just  X  when  it  is  clear 
which  distance  function  we  are  using. 

When  you  learn  the  definition,  don’t  forget  to  say  “LetX  be  a  nonempty  set.  Then 

a  metric  on  X  is  a  function  LXxX->K _ ”  These  sentences  tell  us  something 

about  d,  and  cannot  be  omitted. 

In  the  introduction,  we  showed  that  a  metric  can  be  defined  on  R  by  du(x,y) 
=  \x  —  y|.  Though  we  outlined  how  to  show  that  du  is  a  metric,  you  should  write  out 
the  details  to  complete  the  proof.  This  metric  is  often  called  the  usual  metric  (hence 
the  subscript  u)  or  the  Euclidean  metric  on  R,  and  it  is  the  one  upon  which  your 
intuition  is  almost  certainly  based.  A  set  can  have  lots  of  metrics.  The  next  example 
is  a  metric  on  R  that  is  not  the  same  as  the  metric  given  by  the  absolute  value. 

Example  25.1.  Define  a  metric  dj  :  R  x  R  — ■>  R  by 


We  will  show  that  dd  is  a  metric  on  R.  This  metric  is  called  the  discrete  metric,  and 
it  can  really  challenge  your  intuition. 

Proof.  It  is  clear  that  dd  is  a  function  from  IxR^t.  Now  let  x  and  y  be  points  of 
R.  We  begin  with  nonnegativity:  dd{x,y)  =  0  or  dj(x,y)  =  1,  so  clearly  dd{x,y)  >  0. 
Thus,  the  nonnegativity  condition  holds.  Furthermore,  since  dd(x,y)  =  0  if  and  only 
if  x  =  y,  the  definiteness  condition  holds.  For  symmetry,  note  that  if  x  /  y,  then 
yy^x  and  consequently  dd  (x,  y)  =  1  =  dd  (y,  x) .  If  x  =  y,  then  dd  (x,  y)  =  0  =  dd  (y,  x) , 
establishing  symmetry.  Finally,  we  establish  the  triangle  inequality.  To  this  end, 
note  that  if  z  is  a  point  of  R,  then  we  have  two  cases  to  consider.  In  the  first 
case,  if  x  =  y,  then  dd(x.y)  =  0  and  the  nonnegativity  condition  implies  that 
dd(x,y)  =  0  <  dd{x,z)  +  dd{z,y).  In  the  second  case,  xf^y,  which  implies  that  zy^x 
or  z.  f  y  (or  both).  Therefore,  either  dd(x,z )  =  1  or  dd{z,y)  =  1  (or  both).  Thus, 
dd(x,y)  =  1  <  dd(x,z)  +dd{z,y),  completing  the  proof  of  the  triangle  inequality. 


□ 


The  discrete  metric  can  be  defined  on  every  space:  the  distance  between  two 
distinct  points  is  one,  and  the  distance  from  a  point  to  itself  is  necessarily  zero.  The 
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proof  that  this  is  a  metric  on  a  set  X  is  indistinguishable  from  the  one  above.  Thus 
we  have  an  example  of  a  metric  on  R2.  Example  25.2  and  Exercise  25.3  provide  us 
with  some  other  metrics  on  R2. 

Example  25.2.  On  R2  define  a  metric  by 

du((xi,x2),(yi,y2))  =  \J (*i  -yi)2  +  (*2-y2)2- 


Using  Project  29.10,  it  can  be  shown  that  this  is  actually  a  metric  on  R2.  For  now, 
you  may  accept  this  fact.  This  metric  is  referred  to  as  the  usual  metric  or  the  Eu¬ 
clidean  metric  on  R2.  In  fact,  one  may  also  define  the  usual  metric  on  R"  by 


du((.xi,x2,...,x„),(yi,y2,...,yn))  =  C*/ “T/')2-  O 

Exercise  25.3.  We  now  have  two  examples  of  metrics  on  R  and  two  on  R2.  Here  are 
two  more  metrics  on  R2.  Before  you  begin  the  exercise,  familiarize  yourself  with 
the  metrics  by  computing  various  distances.  For  example,  try  to  find  the  distance 
from  the  point  (1,3)  to  the  points  (—3,4)  using  the  various  metrics  below. 

(a)  Show  that  dtc((x\  ,*2),  (yt,^))  =  |.ri  —  y\  \  +  \x2  —  3*2 1  is  a  metric  on  R2.  This 
metric,  dtc,  is  called  the  taxicab  metric.  Why  would  it  be  called  that? 

(b)  Show  that  dm((x\  ,*2),  (yi  ,^2))  =  max{|xi  — yi  |,  \x2  —  3^2 1 }  is  also  a  metric  on 

R2.  The  metric  dm  is  sometimes  called  the  max  metric.  O 

The  two  examples  introduced  in  Exercise  25.3  will  appear  again  in  the  near  fu¬ 
ture. 

A  metric  tells  us  when  points  are  close.  We  studied  the  notion  of  “closeness”  in 
Chapter  20  when  we  studied  convergent  sequences.  You  can  picture  convergence  of 
a  sequence  to  the  number  L  in  the  following  way:  a  sequence  converges  to  L  if  for 
every  e  >  0,  the  sequence  eventually  lies  in  the  open  interval  (L  —  e,L  +  e);  more 
precisely,  the  definition  of  convergence  said,  “There  exists  a  real  number  L  such  that 
for  every  e  >  0,  there  exists  a  real  number  N  such  that  \x„  —L\<e  for  all  n  >  N .” 
We  return  to  the  idea  of  finding  the  limit  of  a  sequence,  but  this  time  in  a  metric 
space.  So  given  a  sequence  (x„)  of  points  in  a  metric  space  ( X,d ),  then  (as  we  did 
before)  we  say  that  (x„)  converges  in  X  if  there  exists  a  point  x  £  X  such  that  for 
every  e  >  0,  there  exists  a  real  number  N  such  that  d{x„,x)  <  £  for  all  n  >  N.  As 
before,  in  the  event  that  such  an  x  exists,  it  is  also  unique.  (See  Theorem  25.8  below.) 
The  value  x  is  called  the  limit  of  the  sequence,  we  say  that  the  sequence  converges 
to  x,  and,  as  before,  we  write  xn  — >  x  or  \\m„_,rxixn  =  x.  If  the  sequence  does  not 
converge,  we  say  that  it  diverges.  If  we  consider  X  =  R  with  the  usual  metric,  this 
is  exactly  the  same  definition  that  we  had  in  Chapter  20.  Since  we  allow  all  sorts 
of  choices  for  X  now,  we  would  like  to  take  this  opportunity  to  point  out  that  the 


280 


25  Metric  Spaces 


point  x  must  be  in  the  space  X — not  in  some  larger  space  that  happens  to  contain 
X.  If  it  is  clear  that  x  belongs  to  X,  we  will  often  say  that  the  sequence  converges, 
rather  than  “the  sequence  converges  in  XT  Also,  note  that  as  the  metric  d  changes, 
the  distance  between  pairs  of  points  changes  as  well.  Therefore,  it  is  conceivable 
that  some  sequences  will  converge  in  one  metric,  but  not  in  another. 

Exercise  25.4.  Complete  the  sentences. 

(a)  Let  (x„)  be  a  sequence  in  a  metric  space  X  with  metric  d.  Let  x  £  X.  Then 
(x„ )  does  not  converge  to  x  if  ... . 

(b)  Let  (x„)  be  a  sequence  in  a  metric  space  (X,d).  Then  (x„)  does  not  converge 
if ... . 

We’ll  break  tradition  and  give  you  the  answer  to  part  (a)  of  the  above  exercise 
here,  because  we  need  it:  A  sequence  (x„)  does  not  converge  to  x  if  there  exists  an 
e  >  0  such  that  for  every  real  number  N,  there  exists  in  £  N  such  that  m>  N  and 
d(xmix)  >  e.  While  an  answer  to  (b)  might  read  “a  sequence  (x„)  does  not  converge 
if  for  every  x  £  X,  the  sequence  does  not  converge  to  x,”  this  will  probably  not  be 
the  most  useful  formulation  of  the  answer.  We  leave  the  more  useful  version  to  you. 

o 

Example  25.5.  We  know  that  l/«  — >  0  in  E  with  the  usual  metric.  Show  that 
(1  /«,  1  /n)  — ►  (0,0)  in  R2  with  the  usual  metric. 

Proof.  Let  e  >  0,  and  let  A  be  a  real  number  with  N  >  a/2 /e.  If  n  is  an  integer  with 
n  >  N,  then 

du((l/n,l/n),(0,0))  =  V(l/«-0)2  +  (l/n-0)2 
=  a/2 /n 

<  a/2 /N  (since  n  >  N ) 

<  a/2(e/a/2)  (as  N  >  y/l/e) 

=  £. 

See  Figure  25.1  for  a  graphical  illustration  of  this  convergent  sequence.  □ 

You  may  wonder  where  we  came  up  with  \f2/e.  We  did  it  by  understanding  the 
problem  and  devising  a  plan  by  working  backwards.  So  what  you  see  here  is  what 
happened  after  we  went  to  a  separate  sheet  of  paper,  and  started  with  the  inequality 

y/2/n  <  £. 

Example  25.6.  In  Chapter  20,  we  showed  that  the  sequence  (1/n)  converges  to  0  in 
R  with  the  usual  metric.  Does  (1/n)  converge  to  0  in  the  discrete  metric? 

We  claim  that  the  sequence  (in  R  with  the  discrete  metric)  does  not  converge  to 
0.  To  see  this,  let  £  =  1  /2.  For  every  N  €  R,  there  exists  an  integer  n  >  N.  Since 
1  /n  0,  we  know  that  dj  ( 1  /n ,  0)  =  1 .  Hence  for  £  =  1  /2,  and  for  every  N  £  R,  there 
exists  an  integer  n  >  N  such  that*,,  =  1/n  satisfies  dd(xn,0)  =  r4/(l  /n.0)  =  1  >  1/2. 
Thus  (1/n)  does  not  converge  to  0.  O 
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Fig.  25.1  (1/n,  1  /n) 
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In  the  discrete  metric,  every  point  is  “far”  from  every  other  point.  This  makes  it 
very  hard  to  converge. 

Exercise  25.7.  Consider  JR.  with  the  discrete  metric.  Describe  the  convergent  se¬ 
quences  in  this  metric  space.  O 

Sequences  have  many  important  properties,  some  of  which  we  discuss  in  the 
problems.  The  proofs  are  often  quite  similar  to  the  proofs  we  did  in  Chapter  20.  At 
this  point,  we  give  one  example  of  a  theorem  with  such  a  proof. 

Theorem  25.8.  If  a  sequence  (xn )  in  a  metric  space  (X  .cl)  converges,  then  the  limit 
is  unique. 

The  proof  of  this  is,  with  an  appropriate  change  in  notation,  the  same  as  the  proof 
of  Theorem  20.7. 


Definitions 

Definition  25.1.  Let  A  be  a  nonempty  set.  Then  a  metric  on  A  is  a  function 
rf:AxA->R  satisfying  (i)-(iv)  below. 

(i)  (Nonnegativity)  For  all  x,y  €  A,  the  function  d  satisfies  d(x,y)  >  0. 

(ii)  (Definiteness)  For  all  x,y  £  A,  the  function  d  satisfies  d(x.y)  =  0  if  and  only 
if  x  =  y. 

(iii)  (Symmetry)  For  all  x,y  £  A,  the  function  d  satisfies  d(x,y)  =  d(y,x). 

(iv)  (Triangle  inequality)  For  all  x,y,zG  A,  the  function  d  satisfies 

d(x,y)  <  d(x,z)+d(z,y). 

Definition  25.2.  A  nonempty  set  A  together  with  a  metric  d  is  called  a  metric  space 
and  is  denoted  by  (A  ,d)  or,  when  it  is  clear  which  distance  function  we  are  using, 
we  will  simply  refer  to  the  space  A. 
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Definition  25.3.  The  usual  metric  or  Euclidean  metric  on  the  reals  is  defined  by 

du(x,y)  =  \x  —  y\  forx,y  G  R. 

Definition  25.4.  The  discrete  metric  on  R  is  defined  by 


Definition  25.5.  The  usual  metric  or  Euclidean  metric  on  R"  for  n  G  Z+  is  defined 
by 


for  (xi,.. .  ,x„), (yi, . . . ,yn)  G  R". 

Definition  25.6.  The  taxicab  metric  on  M2  is  defined  by 

drC((xi,x2),(yi,y2))  =  |*t  -yi|  +  |*2-y2|  for  (xi,x2),(yi,y2)  g  R2. 

Definition  25.7.  The  max  metric  on  M2  is  defined  by 


dm  ( Oi ,  *2) ,  (y  1 ,  yi) )  =  rnax{  |xi  -  y  1 1 ,  |x2  -  y2 1 }  for  (xi ,  x2) ,  (y  1 ,  J2)  G  R2 . 


Definition  25.8.  A  sequence  (x„)  of  points  in  a  metric  space  (X .cl)  converges  in  X 
if  there  exists  a  point  x  G  X  such  that  for  every  e  >  0,  there  exists  a  real  number  N 
such  that  cl (x„  ■  x)  <  e  for  all  n  >  N.  The  value  x  is  called  the  limit  of  the  sequence 
(x„),  we  say  that  the  sequence  converges  to  x,  and  we  write  x„  — >  x  or  lim„_ioox„  =  x. 

Definition  25.9.  A  sequence  (x„)  diverges  in  a  metric  space  (X .cl)  if  it  does  not 
converge  in  (X,d). 

Definition  25.10  (for  Problem  25.7).  A  set  F  in  a  metric  space  (X  .d)  is  bounded 

if  there  exists  a  positive  real  number  M  such  that  d(x.y)  <  M  for  all  x,y  G  F. 

Definition  25.11  (for  Problem  25.8).  Let  (X .cl)  be  a  metric  space.  The  metric 
db  :  X  x  X  — >  R  defined  by 


db(x,y)  =  min{t/(x,y),  1} 


is  called  the  bounded  metric  associated  with  d  on  X. 


Solutions  to  Exercises 


Solution  (25.3).  Parts  (a)  and  (b)  are  very  similar,  so  we  will  work  part  (a)  only. 
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By  definition,  dtc  is  a  function  from  R  x  K  to  1.  Now  let  (xi,x2)  and  (yi,y2)  be 
elements  of  R2 .  We  will  first  show  the  nonnegativity.  Because  «  >  0  for  all  real 
numbers  a ,  we  know  that  affC((xi,x2),  (yi,y2))  =  |xi  — yi|  +  |x2  —  ^2 1  >  0,  showing 
that  nonnegativity  of  dtc  holds.  For  definiteness,  note  that  dtc{(x  1  ,X2),  (yi,y2))  =  0  if 
and  only  if  |xi  —y\ \  +  |x2  —  y2|  =  0.  This  last  equality  holds  if  and  only  if  |xi  —y\ |  =  0 
and  |x2  —  y2 1  =0.  This,  in  turn,  holds  if  and  only  if  xi  =  yi  and  X2  =  yi\  in  other 
words,  (xi,x2)  =  (yi,y2).  This  string  of  equivalences  establishes  the  definiteness  of 
dtc.  Symmetry  is  shown  as  follows: 

d,c{{x\,x2),{y\,y2))  =  |x  1  — yi  |  +  |x2  — y2| 

=  |yt  —xi  |  +  |y2  —  x2| 

=  d,c{{yi,y2),(xhx2)). 

To  prove  that  the  triangle  inequality  holds  for  dtc,  let  (z.  1  -  Zz)  R  R2  ■  Then 

^c((xi,x2),(yi,y2))  =  |xi  -  yi |  +  |x2  —  y2| 

<  ]xi  -zt|  +  |zt  —  yi  |  -F  |x2  —  Z2 1  -F  |^2  —  y2 1 

(by  the  triangle  inequality  in  R) 

=  (|xi-zi|  +  |x2-z2|)  +  (|zi-yi|  +  |z2->’2|) 

=  d,c{{x  1  ,x2) ,  (zi  ,z2))  +  dtciizi  ,z2) ,  (yi  ,y2))  • 

This  shows  that  dtc  is  a  metric  on  R2 .  The  taxicab  metric  between  two  points  mea¬ 
sures  the  distance  you  have  to  travel  from  one  point  to  the  next  in  a  city  built  with 
rectangular  blocks,  assuming  you  stay  on  the  streets,  do  not  take  detours,  and  do  not 
have  to  worry  about  one-way  streets. 

Solution  (25.4).  The  solution  to  part  (a)  was  given  earlier  following  the  exercise, 
so  here  is  the  solution  to  part  (b). 

The  sequence  (x„)  does  not  converge  in  the  metric  space  [X ,d)  if  for  every  x  G  X 
there  exists  a  real  number  e  >  0  such  that  for  every  real  number  N,  there  exists  m 
such  that  m  >  N  and  d (xm ,  x)  >  e. 

Solution  (25.7).  We  claim  that  a  sequence  (x„)  in  (R,.drj)  converges  if  and  only  if 
there  exist  real  numbers  x  and  M  such  that  xn  =  x  for  all  n  >  M.  (Such  a  sequence  is 
called  an  eventually  constant  sequence.) 

First  assume  that  (x„)  is  a  sequence  for  which  there  exist  real  numbers  x  and  M 
satisfying  xn  =  x  for  all  n  >  M.  We  will  show  that  xn  — >  x.  Let  e  >  0  and  let  N  =  M. 
Then  for  n  >  N  we  know  that  a^(xn,x)  =  r/^(x,x)  =  0  <  £,  which  shows  that  (xn) 
converges. 

For  the  converse,  assume  that  (x„)  converges.  Consider  e  =  1  /2.  Then  there  exists 
N  such  that  d,j(xnix)  <1/2  for  n  >  N.  By  the  definition  of  dj,  the  only  way  that  this 
can  happen  is  if  x„  =  x  for  n  >  N.  Taking  M  =  N,  we  have  shown  that  there  exist  x 
and  M  such  that  xn  =  x  for  all  n>M,  as  desired. 
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Problems 


Unless  otherwise  specified,  assume  that  you  are  working  in  a  general  metric  space 

(X,d). 

Problem  25.1.  (a)  Suppose  a  student  writes  the  following:  A  metric  is  a  function 

satisfying  (i)-(iv)  below. 

(i)  (Nonnegativity)  d(x,y )  >  0, 

(ii)  (Definiteness)  d(x,y)  =  0,  if  and  only  if  x  =  y, 

(iii)  (Symmetry)  d{x,y)  =  d(y,x),  and 

(iv)  (Triangle  inequality)  if  z  is  a  point  in  X,  then 

d(x,y)  <  d(x,z)+d(z,y). 

Write  this  student  a  letter  indicating  what  was  omitted  from  the  definition, 
what  must  be  inserted,  and  what  else  (if  anything)  needs  to  be  changed  to 
make  it  a  correct  definition. 

(b)  Suppose  the  student  had  exactly  the  same  definition  as  in  the  text,  except  for 
the  triangle  inequality,  where  the  student  has  “d(x,y)  <  d(x,z )  +  d(z,y)  for 
some  z  £  X .” 

Write  a  correct,  careful,  and  complete  response  to  this  student. 

Problem  25.2.  (a)  In  R,  find  the  distance  of  the  number  1  to  the  number  3  in  the 

usual  metric  and  in  the  discrete  metric. 

(b)  In  R2,  find  the  distance  of  the  point  (1,3)  to  the  point  (2,5)  in  the  usual 
metric,  the  taxicab  metric,  the  max  metric,  and  the  discrete  metric. 

Problem  25.3.  (a)  Sketch  the  set  {(x,y)  £  R2  :  du((x,y),  (0,0))  <  1},  where  du 

is  the  usual  metric. 

(b)  Sketch  the  set  {(x,y)  £  R2  :  dtc((x,y),  (0,0))  <  1},  where  dtc  is  the  taxicab 
metric. 

(c)  Sketch  the  set  {(x,y)  £  R2  :  dm((x,y),  (0,0))  <  1},  where  dm  is  the  max  met¬ 
ric. 

(d)  Sketch  the  set  {(x,y)  £  R2  :  ^/((x,y),  (0,0))  <  1},  where  dj  is  the  discrete 
metric. 

(e)  Sketch  the  set  {(x,y,z)  £  R3  :  du((x,y,z),  (0,0,0))  <  1},  where  du  is  the  usual 
metric. 

Problem  25.4.  (a)  We  defined  the  max  metric  on  R2.  Define  the  max  metric  on 

R"  and  prove  that  it  is  a  metric. 

(b)  We  defined  the  taxicab  metric  on  R2.  Define  the  taxicab  metric  on  R"  and 
prove  that  it  is  a  metric. 

Problem  25.5.  (a)  Show  that  (/:RxR-iR  defined  by 

</((xi,x2),(yi,y2))  =  |xi  — yi| 


is  not  a  metric  on  R2 
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(b)  Is  d  :  ffi.  x  K.  — >  K.  defined  by 

^((^i,x2),(yi,y2))  =  (*t -y\)2  +  {x2-y2)2 

a  metric  on  R2  ?  If  you  think  it  is  a  metric,  prove  it.  If  you  think  it  is  not  a 
metric,  find  points  in  R2  for  which  one  of  the  conditions  is  violated. 

Problem  25.6.  Let  (X,d)  be  a  metric  space.  Let  a  be  a  real  number  and  define  a 
new  function  da  on  X  x  X  by  da  (x.y)  =  U.d(x,y).  Is  d„  a  metric  on  XI  If  not,  what 
assumptions  must  be  placed  on  a  to  assure  that  da  is  a  metric?  Prove  your  answer. 

Problem  25.7.  A  set  F  in  a  metric  space  (X  .  d)  is  bounded  if  there  exists  a  positive 
real  number  M  such  that  d(x.y)  <  M  for  all  x,y  £  F. 

(a)  Consider  the  following  “not  a  definition”  of  a  bounded  set. 

“A  set  is  bounded  if  for  each  x,y  £  F  there  exists  a  positive  real  number  M 
such  that  d(x,y)  <  M.” 

Give  a  complete,  clear,  concise  explanation  of  the  problems  with  this  defini¬ 
tion. 

(b)  Give  an  example  of  a  metric  space  and  an  infinite  set  that  is  bounded  in  that 
metric.  Prove  that  it  is  bounded. 

(c)  Complete  the  following  definition:  Let  A  be  a  metric  space  with  metric  d  and 

let  F  be  a  subset  of  X.  Then  F  is  not  bounded  if _ 

(d)  Give  an  example  of  a  metric  space  and  a  set  that  is  not  bounded  in  that  metric. 
Prove  that  it  is  not  bounded. 

Problem  25.8.  Let  A  be  a  set  with  a  metric  d.  Define  a  function  db  :  X  x  A  — >  R  by 

db{x,y )  =min{d(x,y),l}. 


(a)  Show  that  db  is  a  metric  on  A.  This  metric  is  called  the  bounded  metric 
associated  with  d  on  A. 

(b)  (This  part  uses  Problem  25.7.)  Consider  the  metric  space  (A ,db).  Show  that 
in  this  space,  every  subset  of  A  is  bounded. 

Problem  25.9.  Let  (A  ,d)  be  a  metric  space  and  let  A  be  a  finite  subset  of  A.  Must  A 
be  bounded?  A  complete  answer  to  this  question  will  either  be  a  proof  that  the  set  A 
must  be  bounded,  or  an  explicit  example  of  a  metric  space  A  and  a  finite  unbounded 
subset  A.  Justify  all  assertions! 

Problem  25.10.  Show  that  in  a  metric  space  (A ,d)  the  metric  satisfies 

\d(x,z)-d(y,z)\  <d{x,y ), 


for  all  x,y,z  £  X. 
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Fig.  25.2  The  metric:  d(xi,yi)  for  i  =  1,2,3 


Problem  25.11.  Let  X  be  the  space  of  polynomials  with  real  coefficients.  Define  a 
function  cl  from  X  x  X  — >  K.  by  d(p,q)  =  |p(0)  —  <?(0)|.  Is  d  a  metric?  If  so,  prove 
it.  If  not,  why  not? 

Problem  25.12.  The  following  problem  is  appropriate  only  if  you  have  had  integra¬ 
tion  in  calculus. 

Let  X  be  the  space  of  real-valued  continuous  functions  defined  on  the  interval 
[0, 1],  Define  a  function  d  :  X  x  X  — >  R.  by 


for  all  f,g£X. 

(a)  Show  that  d  is  a  metric. 

(b)  Find  the  distance  between  ex  and  sin(7Tx/2). 

Problem  25.13.  This  problem  is  appropriate  only  if  you  have  had  integration  in 
calculus. 

Here  (X,d)  denotes  the  space  of  real-valued  continuous  functions  defined  on 
the  interval  [0,1]  with  the  metric  introduced  in  Problem  25.12.  For  each  n  £  Z+, 
we  define  a  function  fn  :  [0, 1]  — »  R.  by  f„(x)  =  xn.  Prove  that  the  sequence  (/„) 
converges  in  X  and  find  I i nv-><»  fn . 

Problem  25.14.  Choose  a  fixed  point  xo  in  R2.  If  du  denotes  the  usual  (or  Euclidean) 
metric  on  R2,  then  we  define  d  :  R2  x  R2  — >  R  by 


du(x,y)  if  x  and  y  are  on  a  straight 


d(x,y) 


line  through  xo 

du(x,xo)+du(xo,y)  otherwise 


Figure  25.2  illustrates  this  function  for  three  pairs  of  points  in  the  plane.  Prove  that 
d  is  a  metric  on  M2. 

This  metric  is  sometimes  called  the  “French  railway  system  metric”  or  the 
“SNCF  metric”  (Societe  Nationale  des  Chemins  de  fer  Fran5ais,  see  [49,  p.  56]). 
The  reason  for  this  is  the  following:  Think  of  xo  as  Paris,  and  you’ll  note  that  all 
trains  pass  through  Paris,  whether  they  need  to  or  not. 
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Problem  25.15.  Let  d  denote  the  “SNCF  metric”  on  R2  as  introduced  in  Prob¬ 
lem  25.14.  For  x\  £  R2  and  a  positive  real  number  r,  sketch  the  circle  C  =  {x  £ 
R2  :  d(x,x i)  =  r}.  You  may  find  it  helpful  to  consider  the  three  cases,  tf(xo,xi)  =  0, 
0  <  of(xo,xi)  <  r,  and  tf(xo,xi)  >  r,  separately. 

Problem  25.16.  Prove  each  of  the  following. 

(a)  Consider  R2  with  the  max  metric.  Prove  that  (1  /n,2/n)  — >  (0,0). 

(b)  Consider  R  with  the  usual  metric.  Prove  that  (— l)"n/(3«  +  1)  />  0. 

(c)  Consider  R2  with  the  max  metric.  Does  ((— l)",2/n))  converge  in  this  space? 

Problem  25.17.  Consider  Z  with  the  usual  metric. 

(a)  Show  that  a  sequence  that  is  eventually  constant  converges;  that  is,  if  there 
exist  integers  m  and  k  such  that  xn  =  k  for  all  n  >  m,  then  the  sequence  con¬ 
verges. 

(b)  Can  you  give  other  examples  of  convergent  sequences  in  (Z ,du)l  Explain 
your  answer. 

Problem  25.18.  Let  ( X,d )  be  a  metric  space,  and  let  (x„)  be  a  convergent  sequence 
in  X. 

(a)  Prove  that  there  exists  x  £  X  and  a  natural  number  K  such  that 

d(x„,x)  <  K  for  alln  £  N. 

(You  should  know  a  similar  problem.) 

(b)  Prove  that  the  set  {x„  :  n  £  N }  is  bounded;  that  is,  prove  that  there  exists  a 
positive  number  M  such  that  d(xn,xm)  <  M  for  all  n,m  £  N. 


Problem  25.19.  In  Problem  20.21  part  (c),  we  defined  the  term  Cauchy  sequence 
and  proved  some  facts  about  such  sequences.  This  problem  asks  you  to  do  the  same 
in  a  general  metric  space. 

(a)  Define  a  Cauchy  sequence  in  a  metric  space  (X .d). 

(b)  Prove  that  if  (x„)  converges  in  (X,d),  then  (x„)  is  Cauchy. 

Problem  25.20.  (This  problem  uses  Problem  25. 19.)  Let  X  =  R  \  Q  with  the  usual 
metric  d„.  Prove  that  the  sequence  (x„),  where  x„  =  y/2/n,  is  a  Cauchy  sequence  in 
X,  but  (x„)  does  not  converge  in  X . 


Problem  25.21.  Let  (X,d)  be  a  metric  space.  Define  a  new  function  de  :  X  x  X 
by 


de(x,y) 


d(x,y) 

1  +d(x,y) 


Show  that  de  is  also  a  metric  on  X. 


Chapter  26 

Getting  to  Know  Open  and  Closed  Sets 


When  we  work  in  R  with  the  usual  metric,  we  think  of  distance  as  measured  by 
absolute  value.  Points  are  close  when  the  absolute  value  of  the  difference  is  small. 
We  might  reasonably  argue  that  points  x  and  y  are  close  when  they  satisfy  |y  —  x|  <r, 
where  r  is  a  small  positive  number;  that  is  to  say,  y  is  in  the  open  interval  (x— r,x+r). 
This  interpretation  allows  us  to  visualize  the  distance  between  the  points.  As  it  turns 
out,  all  metrics  have  this  visual  interpretation. 

Let  x  be  a  point  in  a  metric  space  ( X,d )  and  let  r  be  a  real  number  with  r  >  0. 
Then  the  open  ball  of  radius  r  about  x  is  denoted  B,i  (x.  r)  and  is  defined  by 
Bd(x,r)  =  {y  L  X  :  d(y,x)  <r\.  We  will  call  B,j(x.  1 )  the  open  unit  ball  about 
x.  Note  that  the  radius  of  the  open  ball  Bc/(x,r)  is  always  positive,  and  Bj(x,r)  is 
centered  at  x.  This  is  quite  an  important  definition,  and  we  will  be  able  to  do  a  lot 
with  it.  But  remember,  before  you  work  an  example,  state  a  theorem,  or  write  a 
proof,  make  sure  that  you  and  your  intended  reader  are  clear  on  the  space  you  are 
working  on,  the  metric  you  are  using,  and  what  you  want  to  show. 

Example  26.1.  Consider  the  set  R  with  the  usual  metric.  What  does  Bd„{  1,1/2) 
mean?  Describe  Bdu  (x.  r),  for  an  arbitrary  x£l  and  r  >  0  as  follows:  Describe  the 
set  in  terms  of  open  intervals  and  sketch  the  set  on  a  number  line. 

By  definition,  Bd„  (1, 1/2)  =  {y  £  R  :  du(y,  1)  <  1/2}.  The  notation  is  preventing 
us  from  seeing  something  we  all  know  pretty  well,  so  let’s  get  rid  of  it.  Rewriting, 


0  1/2  1  3/2 


Fig.  26.1  Bdu{  1,1/2) 


U.  Daepp  and  P.  Gorkin,  Reading,  Writing,  and  Proving:  A  Closer  Look  at  Mathematics , 
Undergraduate  Texts  in  Mathematics,  DOI  10. 1007/978-1 -44 19-9479-0  26, 

©  Springer  Science+Business  Media,  LLC  20 1 1 


289 


290 


26  Getting  to  Know  Open  and  Closed  Sets 


Fig.  26.2  Bdu  (x.  r) 


Bdu(  1,1/2)  =  {y£M:|y-l|<  1/2} 

=  {ygK:-l/2<y-l  <  1/2} 

=  {y  £  M :  1/2  <  y  <  3/2} 

=  (1/2, 3/2). 

Figure  26.1  shows  Bdll(  1, 1/2)  graphically. 

The  solution  of  the  general  case  is  the  same  (see  also  Figure  26.2):  By  definition, 
Bdu  (x,  r)  =  {y  £  K.  :  du(y,x)  <  r}  =  {y  £  R  :  |y -x\  <  r]  =  {y  £  K :  -r  <  y -x  <  r} 
=  {y  £  R :  x  —  r<y<x+r}  =  (x—  r,x+r).  Therefore,  Bdn  (x,r)  =  (x—  r,x  +  r).  Q 

The  next  example  shows  that  the  open  balls  depend  on  the  underlying  set  X. 

Example  26.2.  Consider  the  set  X  =  [0, 1)  with  the  usual  metric.  Find  the  open  ball 

2*4  (1/4, 2/3). 

Since  the  center  is  x  =  1/4  and  the  radius  is  r  =  2/3,  we  find 


Bc;„(l/4,2/3)  =  {*£  [0,1)  :  |x-l/4|  <2/3} 

=  {x  £  [0, 1)  :  -2/3  <  x-  1/4  <  2/3} 

=  {x  £  [0,1)  :  — 5/12  <  x  <  11/12} 

=  [0,11/12).  q 

Now  it’s  your  turn. 

Exercise  26.3.  Consider  the  set  R2  with  the  usual  metric  (defined  in  Example  25.2). 
What  is  the  set  Bdu(( 0, 1 ) , 4) ?  Describe  the  set  using  precise  set  notation  and  sketch 
it.  "  O 

These  balls  can  be  used  to  describe  the  basic  structure  of  metric  spaces.  For 
example,  for  two  distinct  points  x  and  y  in  a  metric  space,  we  can  always  find  two 
disjoint  open  balls,  Bx  and  By,  such  that  x  £  Bx  and  y  £  By.  This  probably  agrees  with 
your  intuition.  On  the  other  hand,  as  we  shall  see,  there  exist  metric  spaces  in  which 
sets  consisting  of  a  single  point  are  open  balls!  In  the  remainder  of  this  chapter,  we 
will  see  how  these  balls  can  be  used  to  determine  which  sequences  converge.  But 
before  we  can  develop  the  connection  between  balls  and  convergence,  we  need  to 
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x  -  r  x  x  +  r 


2 


4 


Fig.  26.3  Bdll(x,r)  C  (2,4) 


look  at  some  of  the  important  sets  we  can  make  using  these  balls  together  with  the 
set  operations  we  studied  earlier. 

In  a  metric  space  (X,d),  a  subset  U  of  X  is  open  if  for  every  point  x  £  U,  there 
exists  an  open  ball  Bti(x,r)  satisfying  B,i (x,  r)  C  U .  Note  that  r  depends  on  x,  so 
that  as  x  changes,  r  will  too.  This  definition  is  very  visual,  so  pictures  will  help 
you.  A  word  of  caution  is  in  order  before  we  begin  our  examples:  Throw  away  any 
preconceived  notions  you  have  about  what  an  open  ball  should  look  like. 

Example  26.4.  Show  that  the  interval  (2,4)  is  open  in  M  with  the  usual  metric. 

You  may  be  thinking  that  there’s  nothing  to  show;  after  all,  it’s  an  open  interval. 
But  we  still  need  to  use  the  definition  to  check  that  this  interval  is  open  in  the  sense 
we  have  defined  here,  so  let’s  briefly  review  exactly  what  we  have  to  show. 

We  will  show  that  for  every  x  £  (2,4),  there  is  an  open  ball  Bdi  ,M  C  (2,4). 
We  know  our  metric  is  the  usual  one,  so  by  Example  26.1,  we  know  we  need  to 
show  that  there  exists  a  positive  number  r  with  (x  —  r,x  +  r)  C  (2,4).  To  find  r,  you 
should  draw  pictures  whenever  you  can.  Before  you  read  our  solution,  find  your  own 
following  this  outline:  first,  draw  and  label  an  appropriate  picture,  which  you  will 
then  use  as  you  continue  on  in  this  proof.  Next,  pick  an  arbitrary  point  x  £  (2,4). 
Now,  we  need  to  find  a  positive  number  r  with  Bliu(x,r)  =  (x—  r,x+  r)  C  (2,4). 
Look  at  your  picture  to  find  a  possible  value  of  r.  Then  show  that  it  works. 

Proof.  We  will  prove  that  (2,4)  is  open.  Letx  £  (2,4).  Then  2  <  x  <  4,  so  both  x  — 2 
and  4  — x  are  positive.  Let  r  =  min{x  — 2,4  — x}.  (See  Ligure  26.3.)  Then  r  >  0. 
Now  we’ll  check  that  Bdu(x,r)  C  (2,4).  So  let  y  £  B(iu{x,r).  Then,  by  definition, 
|y  —  x\  <  r.  Thus,  —r<y  —  x<r.  Lrom  the  upper  inequality  we  obtain  y  —  x  <  4  —  x, 
and  hence  y  <  4.  Lrom  the  lower  inequality  we  obtain  —  (x  —  2)  <  y  —  x,  and  hence 
2  <  y.  Thus,  y  £  (2,4),  and  we  conclude  that  Bdu(x,r)  C  (2,4).  □ 

A  few  questions  and  comments  are  in  order.  Lirst,  don’t  forget  to  check  that  r  >  0. 
An  open  ball  of  0  radius,  or  (worse  yet)  negative  radius,  is  no  ball  at  all.  Choosing 
r  as  the  minimum  of  finitely  many  real  numbers  is  a  pretty  standard  thing  to  do,  so 
it’s  a  good  idea  to  get  used  to  it  now.  And  finally,  there  are  lots  of  choices  for  r.  We 
picked  one  value  that  worked.  Every  smaller  positive  value  would  work  too. 

Exercise  26.5.  Let  (X,d)  be  a  metric  space  and  let  U  Cl.  Complete  the  sentence: 
The  set  U  is  not  open  in  (X,d)  if  ... . 

This  is  one  of  those  exercises  where  you  will  want  to  check  your  solution  against 
ours  before  you  go  on.  So  here’s  ours:  The  set  U  is  not  open  in  (X ,d)  if  there  exists 
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a  point  x  £  U  such  that  for  every  open  ball  Bd(x,  r)  about  x,  there  exists  a  point 
y€Bd(x,r)nUc.  O 

Exercise  26.6.  Show  that  the  interval  [2,4]  is  not  open  in  R  with  the  usual  metric. 

o 

Exercise  26.7.  The  setting:  R2  with  the  usual  metric.  Your  mission:  to  show  that  the 
open  unit  ball,  Bdii(( 0,0),  1),  about  (0,0)  is  open. 

Let  the  following  steps  guide  you: 

(a)  Sketch  the  open  ball  of  radius  1  about  the  point  (0, 0). 

(b)  Without  thinking  too  much  about  it,  choose  a  point  in  the  open  ball  (make 
sure  you  don’t  choose  (0,0)).  Make  a  dot  at  the  point,  and  label  it  ( a,b ). 

(c)  Draw  as  large  an  open  ball  as  you  can  that  is  still  contained  in  Bdu(( 0,0),  1) 
and  centered  at  your  dot  ( a,b ).  What’s  the  radius  of  that  open  ball?  (Here’s  a 
potentially  helpful  suggestion:  draw  the  radius  of  the  open  ball  Bdu(( 0,0),  1) 
that  passes  through  the  point  ( a,b ).) 

(d)  Now  you  are  ready  to  start  the  exercise.  Find  a  positive  real  number  r  such 
that  Bdu((a,b),r)  appears  to  be  contained  in  fi^((0,0),  1). 

(e)  Show  that  Bdu((a,b)  ,r)Q  Bdll{{  0,0),  1).  Write  out  the  whole  proof  carefully. 

Include  your  picture;  it’s  very  helpful  for  the  writer  and  the  reader.  O 

There  are  lots  of  interesting  sets  in  a  metric  space,  all  building  on  the  notion  of 
open  ball.  We  have  already  introduced  open  sets.  We  come  now  to  closed  sets.  A 
set  E  in  a  metric  space  X  is  closed  if  and  only  if  the  complement,  Ec,  is  open.  So, 
since  the  complement  of  an  arbitrary  open  set  is  a  closed  set,  we  can  immediately 
write  down  several  closed  sets.  For  example,  in  R  with  the  usual  metric,  the  set 
(—oo,2]  U  [4,oo)  must  be  closed,  since  its  complement  is  the  open  set  (2,4).  (See 
Example  26.4). 

It’s  important  to  know  many  ways  to  show  that  sets  are  open,  closed,  or  neither. 
Here’s  a  useful  result  that  should  have  a  one-line  proof. 

Theorem  26.8.  Let  (X  .cl)  be  a  metric  space.  A  subset  U  ofX  is  open  if  and  only  if 
its  complement  is  closed. 

Exercise  26.9.  Find  the  one-  (or  two-)  line  proof  of  Theorem  26.8.  O 

We  now  have  examples  of  open  sets  and  closed  sets.  But,  as  you  will  see  as  you 
work  the  next  two  exercises,  things  often  get  a  bit  complicated. 


Exercise  26.10.  Give  an  example  of  a  set  that  is  neither  open  nor  closed  in  R2  with 
the  usual  metric.  O 
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Exercise  26.11.  Let  (X,d)  be  a  metric  space.  Is  the  empty  set  open?  closed?  both? 
neither?  O 

You  might  have  found  yourself  concluding  that  if  a  set  is  not  open,  then  it  is 
closed.  This  is  normal,  because  in  ordinary  English  if  a  door  is  not  open,  then  it  is 
closed.  Unfortunately,  in  mathematics,  that’s  false!  In  the  two  exercises  above,  we 
have  seen  examples  of  sets  that  are  neither  open  nor  closed,  and  examples  of  sets 
that  are  both  open  and  closed.  Don’t  assume  anything  when  you  work  the  problems: 
if  we  didn’t  prove  it,  state  it,  or  use  it,  then  it  may  not  be  true. 

We  defined  an  open  ball  and  an  open  set.  You  will  show  (in  Problem  26.13) 
that  every  open  ball  is  an  open  set,  but  since  this  is  so  important,  we’ll  state  it  as  a 
theorem.  It’s  interesting  to  note  that  the  proof  is  very  much  like  the  proof  of  Exercise 
26.7. 

Theorem  26.12.  Let  (X .d)  be  a  metric  space.  For  every  point  x  £  X,  and  every 
positive  real  number  r,  the  set  Bd(x,r)  is  open. 

Now  we  will  get  to  see  some  ways  that  we  can  use  open  sets  and  some  more 
odd  properties  of  metric  spaces.  The  first  theorem  tightens  the  relationship  between 
open  sets  and  open  balls. 

Theorem  26.13.  Let  (X  .d)  be  a  metric  space.  A  set  U  is  open  if  and  only  if  there  is 
a  subset  I  ofX  and  a  set  of  radii  {ry  £  R+  :  y  £  1}  such  that  U  =  |Lg  i^diy,  ry)- 

There  are  some  things  that  might  be  confusing  to  you  in  this  statement,  but  it’s  much 
easier  to  see  what  it  means  to  be  an  open  set  if  you  understand  Theorem  26.13.  The 
index  set  is  a  way  of  saying  that  we  don’t  know  how  many  y  we  have;  there  could 
be  finitely  many  or  not,  countably  many  or  not,  and  this  way  we  don’t  have  to  deal 
with  that  issue.  Next,  the  ry  might  confuse  you.  Each  ball  has  a  (positive)  radius,  and 
if  we  wrote  Bcfy,r)  for  all  y,  we  would  be  saying  that  all  the  balls  have  the  same 
radius,  r.  That’s  not  what  the  theorem  says,  so  we  shouldn’t  say  that  either.  By  using 
the  notation  ry,  we  allow  each  y  in  I  to  have  its  own  radius,  ry.  Having  said  all  this, 
we  now  begin  the  proof. 

Proof.  Suppose  first  that  there  is  a  subset  I  of  X  such  that 


U  =  \jBd(y,ry). 

yet 

By  the  definition  of  open  set,  we  need  to  show  that  for  an  arbitrary  x£U,  there  exists 
an  open  ball  Bj(x,rx)  contained  in  U.  Now  if  x  £  U,  then  there  exists  an  element 
z£l  such  that  x  £  Bcfz ,  rz).  By  Theorem  26. 12,  the  ball  Brfz.  rz)  is  an  open  set,  and 
therefore  there  exists  a  positive  real  number  sx  such  that  Iffx.sf)  C  Iffy..  rz).  Since 
Bd{z,rz)  C  U,  we  know  that  Bd(x  isx)  £  U.  Hence  the  set  U  is  open. 

Now  suppose  that  U  is  open.  We  have  to  find  a  collection  of  open  balls  such  that 
U  is  the  union  of  those  open  balls.  By  the  definition  of  open  set,  if  x  £  U ,  there  exists 
an  open  ball  B(fx,  rx)  with  Bd{x,rx )  C  U.  Now  we  claim  that  U  =  \JxeuBcfx,rx).  If 
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we  establish  this  claim,  our  proof  will  be  complete.  To  see  that  U  is  contained  in  the 
union,  note  that  if  y  £  U,  then  y  £  Bj(y,  ry),  and  therefore  y  £  U xeV^d{x,rx).  Thus, 
U  C  Utgy Bci(x,rx).  To  show  that  U  contains  the  union,  note  that  Bj(x7rx)  C  U  for 
each  x.  From  this  it  is  easy  to  see1  that  U xeU^d{x  ,rx)  C  U,  completing  the  proof. 

□ 

Theorem  26.13  can  be  restated  as  follows:  A  set  U  in  a  metric  space  X  is  open  if 
and  only  if  U  is  a  union  of  open  balls. 

The  proofs  of  many  of  the  theorems  in  this  chapter  provide  an  excellent  opportu¬ 
nity  for  you  to  apply  all  the  techniques  that  you  have  learned  in  this  course.  For  this 
reason,  we  have  left  many  as  problems.  Here’s  another  useful  theorem. 

Theorem  26.14.  An  arbitrary  union  of  open  sets  is  open. 

The  proof  of  this  is  left  as  a  problem  (Problem  26.11)  for  you,  the  reader.  By 
“arbitrary  union”  we  mean  that  we  don’t  know  how  many  sets  we  have.  So  make 
sure  that  you  don’t  accidentally  assume  that  there  are  finitely  many  sets,  or  even 
countably  many. 

Theorem  26.15.  An  arbitrary  intersection  of  closed  sets  is  closed. 

The  proof  of  this  is  left  for  you  to  do  (Problem  26.12).  If  you  have  been  paying 
close  attention  to  the  theorems  and  definitions  presented  thus  far,  this  should  follow 
from  Theorem  26.14.  What  about  an  intersection  of  open  sets?  a  union  of  closed 
sets?  The  results  are  given  below  and  the  proofs  are  outlined  in  the  problems. 

Theorem  26.16.  Let  6) .... ,  Un  be  open  sets.  Then  D/=i  Uj  ,  v  cm  open  set. 

Theorem  26.17.  Let  F] ....  ,Fn  be  closed  sets.  Then  U7=i  Fj  a  closed  set. 

We’ll  conclude  this  chapter  with  the  metric  we  promised  would  challenge  your 
intuition. 

Example  26.18.  Consider  R  with  the  discrete  metric,  drj.  Prove  the  following. 

(a)  For  each  point  x  £  R,  the  set  {x}  is  an  open  ball. 

(b)  Every  set  in  (R,<^)  is  open. 

(c)  Every  set  in  (R,c4/)  is  closed. 

For  part  (a),  note  that  for  x  £  R,  the  set  {jc}  =  B(/d(x ,  1/2).  By  Theorem  26.12, 
the  set  Bjd  ( x ,  r)  is  an  open  set  and,  consequently,  {x}  is  open. 

For  part  (b),  let  S  be  a  subset  of  R.  Since  S  =  U«esM’  fr°m  part  (a)  we  see  that 
S  is  a  union  of  open  sets.  By  Theorem  26. 14,  S  is  open. 

For  part  (c),  let  T  be  a  subset  of  R.  Then  Tc  is  also  a  subset  of  R,  and  it  follows 
from  part  (b)  that  Tc  is  open.  But  a  set  is  closed  if  and  only  if  its  complement  is 
open,  and  therefore  T  is  closed.  O 

There’s  lots  more  that  we  can  do  here,  and  we  will  do  it  in  the  problems. 


1  If  this  isn’t  easy  to  see,  show  it  using  an  element-chasing  argument.  In  fact,  when  you  worked 
Exercise  8.10,  you  already  showed  it. 
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Definitions 

Definition  26.1.  Let  x  be  a  point  in  a  metric  space  (X,d)  and  let  r  be  a  real  number 
with  r  >  0.  Then  the  open  ball  of  radius  r  about  x  is  denoted  Bd  (x.  r)  and  is  defined 

by  Bd(x,r)  =  {y  £  X  :  d(y,x )  <  r}. 

Definition  26.2.  The  open  unit  ball  about  x  in  a  metric  space  (X,d)  is  the  open 
ball  Bd(x,  1). 

Definition  26.3.  In  a  metric  space  (X,d),  a  subset  U  of  A"  is  open  if  for  every  point 
x  £  U,  there  exists  an  open  ball  Bd(x,r)  satisfying  Bd(x  ,r)Q  U. 

Definition  26.4.  A  set  £  in  a  metric  space  X  is  closed  if  the  complement,  Ec,  is 
open. 

Definition  26.5  (for  Problem  26.18).  Let  (X,dx)  and  (Y,dy)  be  metric  spaces.  A 
function  / :  (X,dx)  — >  ( Y,  dY )  preserves  distances  if 


dy{f{x),f(x!))  =  dx{x,x!) 


for  all  x,x!  in  X. 

Definition  26.6  (for  Problems  26.19  and  26.20).  Let  £  be  a  subset  of  a  set  X  with 
metric  d.  A  point  x  is  said  to  be  an  interior  point  of  £  if  there  exists  an  open  ball 
Bd(x,  r)  with  Bd(x,  r )  C  £.  The  set  of  all  interior  points  is  called  the  interior  of  £ 
and  is  denoted  by  E°. 

Definition  26.7  (for  Problems  26.21  through  26.24).  Let  (X,d)  be  a  metric  space. 
Let  £  C  X.  A  point  x  £  X  is  a  limit  point  of  £  if  every  open  set  containing  x  contains 
a  point  y  £  E  with  y^x.  We  denote  the  set  of  all  limit  points  of  the  set  £  by  £;. 

Definition  26.8  (for  Problem  26.24).  Let  £  be  a  subset  of  a  metric  space  X.  The 
closure  of  E  is  denoted  by  £  and  is  defined  by  £  =  £  U  £/. 


Solutions  to  Exercises 

Solution  (26.3).  We  first  calculate  Bdu((0, 1) ,4).  The  graphical  representation  is 
shown  in  Figure  26.4. 


Bdum  1),4)  =  {(x,y)  £  R2  :  du((x,y),  (0, 1))  <  4} 


{(x,y)  £  R2  :  x2  +  (y-  l)2  <  16}. 
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Fig.  26.4  5f/„  ((0, 1) ,  4) 


Solution  (26.6).  We  will  use  the  solution  to  Exercise  26.5:  Choose  x  =  2  and  note 
that  2  £  [2,4].  Let  (2,  r)  be  an  arbitrary  open  ball  about  2.  Then  r  is  a  positive 
real  number.  We  claim  that  2  —  r/2  £  Bju(2,r)  (T  [2,4]c.  Since  du(2  —  r/2,2)  =  |(2  — 
r/2)  —  2|  =  r/2  <  r,  we  conclude  that  2  —  r/2  £  Bt/u(2,r).  Also,  since  2  —  r/2  <  2 
we  know  that  2  —  r/2  ^  [2,4].  This  establishes  the  claim  and  we  have  shown  that 
[2,4]  is  not  open  in  R  with  the  usual  metric. 

Solution  (26.7).  We  follow  the  outline  provided  beginning  with  the  illustration  in 
Figure  26.5. 

For  (a, b)  £  Bd„((0,0),  1),  let  r  =  1  —  V a2  +  b2.  Now  we  claim  that  Bju((a,b),r) 
is  an  open  ball  about  (a,b)  satisfying  Bau((a,b),r )  C  B^u{{ 0,0),  1). 

For  the  first  part  of  this  claim,  note  that  since  (a,b)  £  Bju(( 0,0),  1),  we  have 
du((a,b),( 0,0))  =  \/ a2  +  b2  <  1.  Hence  r  =  1  —  \/a2  +  b2  >  0  and  Bdu((a,b),r)  is 
an  open  ball  about  (a,/?). 

To  prove  the  set  inclusion,  let  (x,y)  £  Bju((a,b),r).  Then 

du((x,y),  (0,0))  <  du((x,y),(a,b))  +  du((a,b),( 0,0)) 

(by  the  triangle  inequality  for  the  metric  du) 

<  r+\/a2+b2  (since  (x,y)  £  Bdu((a,b),r)) 

=  1  —  \/a2  +  b2  +  \/ a2  +b2  =  1. 

Hence  (x,y)  £  Bju(( 0,0),  1).  This  establishes  the  second  part  of  the  claim. 

The  definition  of  an  open  set  implies  that  B^h(( 0,0),  1)  is  open. 

Solution  (26.9).  By  the  definition  of  closed  set,  the  set  Uc  is  closed  in  a  metric 
space  (X,d)  if  and  only  if  ( Uc)c  =  U  is  open  inX. 

Solution  (26.10).  The  set  A  =  {(x,y)  :  0  <  x  <  1}  is  neither  open  nor  closed  in  R2. 

To  show  that  A  is  not  open,  consider  (1,0)  £  A.  Let  r  £  R  with  r  >  0.  Then 
(l+r/2,0)  £^((1,0)/)  and  (l  +  r/2,0)  Since  r  was  an  arbitrary  positive 
real  number  we  conclude  that  for  each  r  £  R+  we  have  _Bt/l(((l,0),r)  <2  A.  Hence  A 
is  not  open. 
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Fig.  26.5  Sketch  to  find  r,  the  radius  of  the  open  ball  centered  at  (a,  b ) 


To  show  that  A  is  not  closed,  we  will  show  that  Ac  is  not  open.  To  this  end, 
consider  (0,0)  £  Ac  =  {(x,y)  £  R2  :  x  <  0  orx  >  1}.  We  may  assume  that  r  <2.  For 
every  r  £  R+  with  r  <  2  we  have  (r/2,0)  £  B(/u((0,0),r)  and  (r/ 2,0)  $  Ac.  Thus  Ac 
is  not  open  and  hence  A  is  not  closed. 

Solution  (26.11).  The  empty  set  is  open  in  every  metric  space  (X,d).  The  reason  is 
that  the  antecedent  “x  £  0”  is  always  false.  This  means  that  the  defining  implication 
is  always  true  for  0. 

The  whole  space  X  is  also  always  open:  For  all  x  £  X,  we  know  that  Brj (x.  r)  C  X 
for  every  positive  real  r.  Since  0C'  =  X  and  X  is  open,  the  definition  of  closed  implies 
that  0  is  closed. 

Thus  we  have  two  examples  of  sets  that  are  both  open  and  closed:  X  and  0. 


Problems 


We  continue  to  assume  that  X  is  a  metric  space  with  metric  d  unless  otherwise 
stated. 

Problem  26.1.  Show  that  the  set  (1,3)  U  (4,5)  is  open  in  R  with  the  usual  metric. 

Problem  26.2.  Consider  R  with  the  usual  metric.  In  each  case,  give  an  example  of 
a  nonempty  closed  set  C  and  a  nonempty  open  set  U  such  that 

(a)  U  fl  C  is  open. 

(b)  UUC  is  closed. 

(c)  U  fl  C  is  neither  open  nor  closed. 

(d)  UUC  is  neither  open  nor  closed. 

Problem  26.3.  Consider  the  set  A  =  {(x,y)  £  R2  :  —  1  <  x  <  0,  —  1  <  y  <  1}.  Prove 
that  A  is  open  in  R2  with  respect  to  the  max  metric,  dm.  Include  a  sketch  with  your 
proof. 

Problem  26.4.  Decide  whether  the  statements  below  are  true  or  false.  If  the  state¬ 
ment  is  true,  give  a  brief  reason  why.  If  the  statement  is  false,  give  a  counterexample. 
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(a)  In  R  with  the  usual  metric,  the  interval  [0,°°)  is  a  closed  set. 

(b)  In  R  with  the  discrete  metric,  the  interval  [0,°°)  is  an  open  set. 

(c)  A  finite  union  of  open  sets  is  an  open  set. 

(d)  An  arbitrary  union  of  closed  sets  is  a  closed  set. 

Problem  26.5.  Consider  the  set  M+  with  the  usual  metric. 

(a)  Show  that  the  set  (1,°°)  is  an  open  set  in  (R+,ri„). 

(b)  Show  that  the  set  (0, 1]  is  a  closed  set  in  (R+,riu). 

Problem  26.6.  Let  (X,d)  be  a  metric  space. 

(a)  Prove  that  for  all  x  and  y  in  X,  if  y  ^  x,  then  there  exists  an  open  ball  centered 
at  y,  say  Bd(y,  r),  such  that  x  <$_  Bd(y,  r). 

(b)  Prove  that  if  x  £  X,  then  {a}  is  a  closed  set. 

Problem  26.7.  Let  (X,d)  be  a  metric  space.  Let  (Uj)°°=1  be  a  sequence  of  open 
sets. 

(a)  Give  an  example  to  show  that  fl7=i  Uj  may  not  be  open. 

(b)  Is  it  ever  true  that  f)7=i  Uj  is  open? 

Problem  26.8.  Let  E  =  {(x,y)  £  R2  :  1  <x2  +y2  <  4}.  Prove  that  E  is  open  in  R2 
with  respect  to  the  usual  metric. 

Problem  26.9.  Complete  the  following  definition.  Let  A  be  a  metric  space  with  met¬ 
ric  d  and  let  Fbea  subset  of  X.  Then  F  is  not  closed  if _ 

Problem  26.10.  (a)  Give  three  examples  of  sets  that  are  both  open  and  closed  in 

R  with  the  discrete  metric. 

(b)  Give  two  examples  of  sets  that  are  both  open  and  closed  in  R  with  the  usual 
metric. 

(c)  Give  an  example  of  a  set  that  is  neither  open  nor  closed  in  M2  with  the  max 
metric.  Prove  that  it  is  neither  open  nor  closed! 

(d)  Give  an  example  of  a  set  that  is  closed  and  not  open  in  R2  with  the  usual 
metric.  Prove  that  it  is  closed  and  not  open! 

Problem  26.11.  Prove  Theorem  26.14.  (In  other  words,  show  that  if  {Oa  '■  d  £  1} 
is  a  collection  of  open  sets,  then  U aei^a  is  open.) 

Problem  26.12.  Prove  Theorem  26.15. 

Problem  26.13.  Prove  Theorem  26.12. 

Problem  26.14.  Let  (X,d)  be  a  metric  space,  x.y  £  X,  and  let  r\  and  r2  be  positive 
real  numbers  with  r\  <  r2.  In  what  follows,  do  not  use  Theorem  26.16. 

(a)  Show  that  Bd(x,n)  C  Bd(x,r2). 

(b)  From  the  previous  problem  we  know  that  every  open  ball  is  open.  Show  that 
if  Bd{x,r\)  and  Bd(y,r2)  are  open  balls,  then  Bd(x1r i)  nBd(y,r2)  is  an  open 
set.  Is  it  an  open  ball?  (Justify  your  answer  to  this  last  question,  please.) 
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Problem  26.15.  Prove  Theorem  26.16  by  completing  both  steps  below.  You  may 
find  it  very  helpful  to  work  Problem  26.14  first. 

(a)  Show  that  the  intersection  of  two  open  sets  is  an  open  set. 

(b)  Show  that  the  intersection  of  finitely  many  open  sets  is  an  open  set. 

Problem  26.16.  Prove  Theorem  26.17.  If  you  did  Problem  26.15,  you  might  con¬ 
sider  using  that  result  here. 

Problem  26.17.  (a)  Let  x  and  y  be  two  distinct  points  in  X  and  let  r  =  d(x,y) /2. 

Show  that  Bj{x1r)  and  Bci (y, r)  are  disjoint  sets. 

(b)  Show  that  for  two  distinct  points  x  and  y  in  a  metric  space,  there  exist  disjoint 
open  sets  6X  and  Gy  with  x  £  Gx  and  y  £  Gy. 

Problem  26.18.  Let  (X,dx)  and  ( Y. dy )  be  metric  spaces.  We  say  that  a  function 
/ :  (X,dx)  — >  (Y,dy)  preserves  distances  if  afr(/(x),/(x'))  =  dx{x,x!)  for  all  x,x' 

in  X. 

(a)  In  R  with  the  usual  metric,  the  function  /  :  (R,<4)  — >  (R ,d„)  defined  by 
f{x)  =  x  obviously  preserves  distances.  Give  an  example  of  another  function 
that  preserves  distances. 

(b)  Is  every  function  that  preserves  distances  one-to-one?  Either  prove  this  state¬ 
ment  or  give  a  counterexample. 

Problem  26.19.  Let  £  be  a  subset  of  a  set  X  with  metric  d.  A  point  x  is  said  to  be 
an  interior  point  of  E  if  there  exists  an  open  ball  £</(x,r)  with  B,j (x  ,r)CE.  The 
set  of  all  interior  points  is  called  the  interior  of  E  and  is  denoted  by  E° . 

(a)  In  R  with  the  usual  metric,  and  E  =  (2,4],  show  that  4  ^  E° .  Then  show  that 
(2,4  )=E°. 

(b)  In  M2  with  the  max  metric,  find  E°  if  E  =  {(x,y)  :  |x|  <  1}. 

Problem  26.20.  This  problem  is  only  appropriate  if  you  completed  Problem  26.19. 
Let  (X,d)  be  a  metric  space,  and  £  be  a  subset  of  X. 

(a)  By  the  definition  of  interior  point,  if  x  £  E°,  then  there  exists  an  open  ball 
Bj(x,  r)  centered  at  x  such  that  Bcfx  ,r)C£.  Show  that,  in  fact,  for  each  point 
x  £  E°,  there  exists  an  open  ball  Bj(x  ,rx)  C£°.  Use  this  to  prove  that  £°  is 
an  open  set. 

(b)  Prove  that  a  set  £  is  open  if  and  only  if  every  point  of  £  is  an  interior  point. 
Conclude  that  a  set  £  is  open  if  and  only  if  £  =  E°. 

Problem  26.21.  Let  (X,d)  be  a  metric  space.  Let  £  C  X.  A  point  x  £  X  is  a  limit 
point  of  £  if  every  open  set  containing  x  contains  a  point  y  £  E  with  y  /  x.  Let  £/ 
denote  the  set  of  all  limit  points  of  the  set  £. 

(a)  Complete  the  following  definition.  A  point  x  £  X  is  not  a  limit  point  of  £  if 
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(b)  What  are  the  limit  points  of  the  interval  (2,4]  in  R  with  the  usual  metric?  the 
discrete  metric? 

(c)  What  are  the  limit  points  of  £</„(( 0,0),  1)  in  R2  with  the  usual  metric?  the 
discrete  metric? 

(d)  What  are  the  limit  points  of  {l/n  :  n  £  Z+}  in  R  with  the  usual  metric?  the 
discrete  metric? 

Problem  26.22.  (This  problem  assumes  that  you  have  completed  Problem  26.21.) 
Let  (X ,d)  be  a  metric  space.  Let  £  be  a  subset  of  X.  Show  that  x  is  a  limit  point  of 
a  set  £  if  and  only  if  every  open  ball  Bli(x:r)  about  x  contains  a  point  y  £  E  with 
y±x. 

Problem  26.23.  (This  problem  assumes  that  you  have  completed  Problem  26.21.) 
Let  (X  ,d)  be  a  metric  space.  Let  £  be  a  subset  of  X. 

(a)  Prove  that  x  is  a  limit  point  of  a  set  £  if  and  only  if  every  open  set  about  x 
contains  infinitely  many  points  different  from  x. 

(b)  Prove  that  a  finite  set  has  no  limit  points. 

Problem  26.24.  (This  problem  assumes  that  you  have  completed  Problem  26.21.) 
Let  (X  ,d)  be  a  metric  space.  Let  £  be  a  subset  of  X. 

(a)  Show  that  £  is  closed  if  and  only  if  £  contains  all  its  limit  points.  In  other 

words,  prove  that  £  is  closed  if  and  only  if  £/  C  £. 

(b)  Let  £  be  a  set.  The  closure  of  E  is  denoted  by  £  and  is  defined  by  £  =  £  U£/. 

Show  that  if  x  is  a  limit  point  of  £,  then  x  is  a  limit  point  of  £. 

(c)  Show  that  if  x  is  a  limit  point  of  £,  then  x  £E  (note  that  you  did  the  hard  part 

in  (b)  above).  Conclude  that  £  is  closed. 


Chapter  27 

Modular  Arithmetic 


You  began  your  mathematical  education  adding,  subtracting,  multiplying,  and  divid¬ 
ing  integers.  From  there  you  moved  on  to  rational  numbers,  then  to  real  numbers, 
and,  perhaps,  to  complex  numbers.  But  this  is  not  the  only  kind  of  arithmetic  math¬ 
ematicians  study.  In  fact,  there’s  a  very  different  kind  of  arithmetic  that  you  use 
every  single  day.  The  popular  name  for  these  calculations  is  clock  arithmetic,  and  it 
is  indeed  based  upon  the  clock. 

Consider  the  following  scenario:  Suppose  it  is  now  3:00  P.M.,  and  you  start  on 
a  28-hour  trip.  What  time  will  it  be  when  you  return?  A  quick  calculation  yields  an 
answer  of  7:00  P.M.  How  did  you  arrive  at  this  answer?  You  did  something  we  all 
find  natural — you  did  clock  arithmetic.  We  will  build  carefully  upon  this  idea,  and 
we  will  apply  many  of  the  concepts  we  have  already  covered  to  help  us  understand 
it. 

Clock  arithmetic  isn’t  done  on  numbers,  but  rather  on  equivalence  classes  of 
numbers.  So  we  need  to  find  the  right  equivalence  relation.  Recall  that  for  two  inte¬ 
gers  a  and  b  with  a  /  0,  we  say  that  a  divides  b,  written  a  \  b,  if  there  is  an  integer  k 
such  that  b  =  ak.  Now  we  are  ready  for  the  equivalence  relation.  Let  n  £  Z  be  such 
that  n  >  1 .  Two  integers  x  and  y  will  be  related  if  n  (y  —  x) .  In  this  case  we  say  x  is 
congruent  to  y  modulo  n,  and  we  will  write  x  =  y  (mod  n). 

Exercise  27.1.  Find  five  different  solutions  to  each  of  the  problems  below,  and  then 
find  five  integers  that  are  not  solutions. 

(a)  Find  x  £  Z  such  that  5  =  x  (mod  12). 

(b)  Find  x  £  Z  such  that  x  >  1  and  —3  =  39  (mod  x).  O 

Theorem  27.2.  Let  n  >  1  be  an  integer.  The  relation  congruence  modulo  n  is  an 
equivalence  relation  on  Z. 

The  proof  of  this  theorem  is  left  as  Problem  27.3. 

For  an  integer  n  >  1,  the  set  of  all  equivalence  classes  with  respect  to  the  relation 
congruence  modulo  n  is  called  the  integers  modulo  n  and  denoted  by  Z„.  It  follows 
from  Theorem  11.4  that  Z„  is  a  partition  of  Z.  There  will  be  times  when  we  will 

U.  Daepp  and  P.  Gorkin,  Reading,  Writing,  and  Proving:  A  Closer  Look  at  Mathematics ,  301 

Undergraduate  Texts  in  Mathematics,  DOI  10. 1007/978-1 -44 19-9479-0  27, 

©  Springer  Science+Business  Media,  LLC  20 1 1 


302 


27  Modular  Arithmetic 


need  to  refer  to  the  elements  of  Z„,  and  since  these  are  equivalence  classes  and  not 
just  integers,  the  notation  must  be  chosen  carefully.  So  we  introduce  the  following: 
for  m  €  Z,  we  write  [m]n  =  {x  £  Z  :  n  \  (x  —  m) } . 

Note  that  we  now  have  two  ways  of  denoting  exactly  the  same  thing.  For  integers 
a,b,  and  n  with  n  >  1,  the  two  statements  “a  =  b  (mod  «)”  and  “[a]„  =  [/?]„”  are 
equivalent. 

Let  us  stop  and  think  about  what  this  all  means  in  the  context  of  time.  Suppose  we 
are  told  that,  “in  this  camp,  breakfast  is  served  at  7:00  A.M.”  What  does  this  mean? 
Is  there  exactly  one  instant  on  a  certain  day  and  time  at  which  breakfast  is  served? 
This  can  hardly  be  the  case,  since  we  eat  breakfast  every  day.  What  it  must  mean 
is  that  breakfast  is  served  at  7:00  A.M.  today,  tomorrow,  yesterday,  and  in  a  week. 
So  7:00  A.M.  actually  represents  many  different  times,  as  long  as  the  difference 
between  that  time  and  7:00  A.M.  is  a  multiple  of  24  hours.  Mathematically,  this 
idea  is  expressed  by  an  equivalence  class.  The  class  of  7  modulo  24  is  the  set  of  all 
integers  that  differ  from  7  by  a  multiple  of  24.  Thus, 

[7]24  =  [31]24  =  [ — 4 1]  24  =  •••  =  {••  ■  ,-41,-17,7,31,55, . . .}. 

The  “numbers”  in  modular  arithmetic  are  sets  of  numbers. 

Before  exploring  some  of  the  properties  of  the  integers  modulo  n,  we  need  to 
learn  a  bit  more  about  the  integers  themselves.  You  are  no  doubt  familiar  with  these 
properties  of  the  integers,  but  you  may  not  know  the  rigorous  definitions  or  the  ex¬ 
act  statements  of  the  theorems.  The  first  statement,  which  was  introduced  in  Prob¬ 
lem  13.22,  is  simply  about  division  of  one  integer  by  another. 

Theorem  27.3  (Division  algorithm).  Let  m  and  n  be  integers  with  n  ^  0.  Then  there 
exist  unique  integers  q  and  r  such  that  m  =  nq  +  r  and  0  <  r  <  \n\. 

In  plain  English,  the  division  algorithm  says  that  for  two  integers,  m  and  n,  we 
can  write  m  as  a  multiple  of  n  plus  what’s  left  over.  Of  course,  that’s  just  the  state¬ 
ment  that  q  is  the  quotient  and  r  the  remainder  when  we  divide  m  by  n.  You  might 
not  understand  why  we  need  to  prove  this — after  all,  you  have  been  using  it  for  a 
long  time.  But  did  you  ever  stop  to  think  about  what  it  really  means  and  why  it  is 
true?  In  fact,  if  you  worked  Problem  13.22,  then  you  already  proved  this  theorem 
following  the  outline  that  was  provided.  (The  statement  of  the  result  was  presented 
in  Theorem  13.6.)  If  you  have  not  already  done  so,  this  would  be  the  right  moment  to 
return  to  that  problem  and  the  outline  and  produce  a  proof  of  the  division  algorithm. 
We’ll  need  this  result  very  soon. 

Let’s  move  on  to  another  old  friend  from  the  past.  Given  two  numbers,  say  28 
and  42,  what  is  the  gcd  (or  greatest  common  divisor)  of  the  two  numbers?  You  can 
probably  figure  out,  without  too  much  trouble,  that  the  answer  is  14.  But  now  that 
you  have  much  more  mathematical  experience,  we  are  able  to  ask  (and  answer)  the 
more  complicated  questions  of  “how  can  we  give  a  precise  definition  of  gcd?”  and 
“is  there  an  algorithm  to  find  its  value?” 

Define  the  greatest  common  divisor  d  (which  we’ll  soon  see  is  unique)  of  two 
integers  m  and  n,  where  m  and  n  are  not  both  zero,  to  be  the  positive  integer  d  that 
satisfies 
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(i)  d\m  and  d\n,  and 

(ii)  if  s  is  a  positive  integer  such  that  s\m  and  s\n,  then  s \d. 

We  denote  the  greatest  common  divisor  of  m  and  n  by  gcd(m,n).  We  say  m  and  n 
are  relatively  prime  if  gcd(m,«)  =  1. 

Exercise  27.4.  (a)  What  does  condition  (i)  of  the  definition  of  the  greatest  com¬ 

mon  divisor  really  say? 

(b)  What  does  condition  (ii)  really  say? 

(c)  We  mentioned  that  the  gcd  of  two  numbers  is  unique.  How  would  you  try  to 

prove  this?  O 

Exercise  27.5.  Find  gcd(— 16,40),  gcd(0,45),  and  gcd(— 30,  — 27).  O 

The  next  theorem  tells  us  that  the  gcd  always  exists  and  is,  as  we  promised, 
unique. 

Theorem  27.6.  Let  m  and  n  be  integers,  not  both  zero.  Then  their  greatest  common 
divisor  exists,  is  unique,  and  there  are  integers  k  and  l  such  that  gcd (m,  n )  =  km+ln. 


This  theorem  actually  tells  us  more  than  the  existence  and  uniqueness  of  the 
greatest  common  divisor.  It  tells  us  that  the  gcd  can  be  expressed  as  the  sum  of 
multiples  of  the  two  numbers  m  and  n.  This  fact  is  certainly  not  obvious,  and  it  will 
turn  out  to  be  very  useful.  A  sum  of  the  form  km  +  In  where  k  and  /  are  integers 
is  called  a  linear  combination  of  m  and  n.  Theorem  27.6  is  usually  proved  by  first 
showing  that  gcd(m.n)  =  km  +  ln.  The  proof  looks  at  the  set  A  of  all  the  linear 
combinations  of  m  and  n  that  yield  a  positive  integer.  Since  A  will  be  a  nonempty 
set  of  positive  integers,  the  well-ordering  principle  tells  us  that  this  set  has  a  smallest 
element.  It  turns  out  that  this  element  will  satisfy  both  (i)  and  (ii)  in  the  definition 
of  greatest  common  divisor.  Once  we  have  shown  this,  we  will  still  need  to  present 
an  argument  that  there  is  no  other  integer  that  is  also  the  gcd  of  m  and  n. 

Proof.  For  m,n  £  Z  not  both  zero,  define  A  =  {xm  +  yn  :  x,y  £  Z  and  xm+yn  >  0}. 
First  we’ll  show  that  A^0.  We  know  that  m,n  £  Z,  and  so  we  may  set  x  =  m  and 
y  =  n.  Since  m  f  0  or  n  f  0,  we  conclude  that  xm  +  yn  =  nf  +  nf  >  0,  and  hence 
m2  +  n2  £  A.  Thus  A  f  (fi.  By  the  well-ordering  principle,  every  nonempty  set  of 
positive  integers  has  a  smallest  element,  and  we  call  this  element  d.  Since  d  £  A, 
there  exist  xo,yo  £  Z  such  that  d  =  xom+y^n.  We  will  show  that  d  =  gcd(m,n), 
proving  two  parts  of  the  theorem,  namely,  that  a  greatest  common  divisor  exists, 
and  that  this  divisor  can  be  written  in  the  form  km  +  In  for  some  k,l  £  Z. 

Since  d  £  A,  we  know  that  d  >  0.  By  the  division  algorithm  (Theorem  27.3),  we 
can  write  m  =  qd  +  r,  where  q  and  r  are  two  integers  with  0  <  r  <  d.  Therefore 
r  =  m  —  dq  =  m—  ( XQin  +  yon)q  =  (1  —  xo q)m+  (— yoq)n ,  where  1  —  xo q  and  —yoq 
are  integers.  Now  if  r  >  0,  then  r  would  be  an  element  of  A.  But  r  <  d  and  d  is  the 
smallest  element  of  A.  This  means  that  r  ^  A.  Hence  it  must  be  the  case  that  r  =  0; 
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in  other  words,  d\m.  Exactly  the  same  argument  shows  that  d\n.  Thus  d\m  and  d\n, 
and  (i)  in  the  definition  of  gcd  holds. 

Suppose  that  s  is  a  positive  integer  such  that  s\m  and  s\n.  Since  d  =  XQtn+yon, 
we  conclude  that  s|d.  (You  are  asked  to  write  out  the  details  of  this  last  step  in 
Problem  27.1.)  Hence  (ii)  also  holds  for  d.  We  now  know  that  a  greatest  common 
divisor  exists  and  has  the  right  form.  It  remains  to  show  that  it  is  unique. 

So  suppose  that  d  and  t  are  both  greatest  common  divisors  of  m  and  n.  Then, 
since  d  is  a  gcd,  property  (ii)  of  the  definition  says  that  t\d.  On  the  other  hand,  t  is  a 
gcd,  so  d\t.  We  conclude  that  t\d  and  d\t.  Since  both  t  and  d  are  positive  integers  it 
follows  (see  Problem  27.2)  that  t  =  d,  completing  the  proof  of  uniqueness.  □ 

Incidentally,  while  the  greatest  common  divisor  d  is  unique,  the  integers  xo  and 
yo,  as  defined  in  the  proof,  are  not.  Here  is  a  simple  example:  Consider  the  integers 
m  =  6  and  n  =  9.  Then 

gcd(6,9)  =  3  =  2-6  + (—1) -  9=  (— 1)  -6  +  1  •  9. 

Unfortunately,  the  proof  of  Theorem  27.6  was  not  constructive;  that  is,  it’s  a  nice 
enough  proof,  but  it  doesn’t  really  tell  us  how  to  find  gcd(m,  n) .  However,  there  is  an 
algorithm  to  do  just  that — one  that  appeared  in  Euclid’s  Elements  over  2,300  years 
ago.  The  algorithm  is  appropriately  called  the  Euclidean  algorithm  and  you  will 
learn  to  apply  it  in  Problem  27.20  to  calculate  the  gcd  of  two  integers. 

We  now  return  to  modular  arithmetic.  To  get  you  back  into  the  proper  state  of 
mind,  we  suggest  that  you  reread  (in  the  beginning  of  this  chapter)  what  it  means 
for  two  integers  to  be  equivalent  modulo  n,  where  n  >  1.  Then  work  the  following 
exercise: 

Exercise  21.1.  Show  that  for  integers  m  and  n  with  n  >  1,  there  exists  an  integer  r 
satisfying  0  <  r  <  n  such  that  m  =  r  (mod  n).  Q 

One  good  thing  about  the  integers  is  that  we  can  perform  basic  algebraic  ma¬ 
nipulations  on  them,  like  adding,  subtracting,  and  multiplying.  Can  we  do  this  on 
Z„  also?  The  answer  is  yes,  but  we  must  first  carefully  define  how  these  operations 
work  on  the  equivalence  classes  that  make  up  the  set  Z„.  That’s  what  we  will  do 
right  after  we  work  an  example  to  remind  you  what  it  means  for  two  equivalence 
classes  modulo  n  to  be  the  same. 

Example  27.8.  For  integers  r.s.  and  n  with  n  >  1,  prove  that  \r]„  =  [,v] „  if  and  only 
if  there  exists  k  £  Z  such  that  r  —  s  =  kn. 

Proof.  By  Problem  10.10,  I)'],,  =  [.v] „  if  and  only  if  r  ~  s.  Thus  [r]„  =  [s]„  if  and  only 
if  r  =  s  (mod  n).  Hence  [r]„  =  [s]„  if  and  only  if  there  exists  an  integer  k  such  that 
r  —  s  =  kn.  □ 

Be  sure  to  keep  this  fact  in  mind  as  you  read  on  in  the  text,  and  especially  as  you 
work  your  way  through  Example  27.9,  in  which  we  will  show  that  multiplication  on 
Z„,  as  introduced  below,  is  well-defined. 
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Fix  an  integer  n  >  1 .  Now  Z„  is  closely  related  to  Z,  so  we  will  try  to  modify  the 
operations  of  Z  so  that  they  apply  to  Z„.  For  r.s  £  Z,  define 

M»  +  [s]n  =  [''+i]n,  M„-[.s]„=  [r-s]„,  and  [r]n  •  [s]„  =  [rs)n. 

Before  going  on,  convince  yourself  that 

[12]s  +  [7]s  =  [4]s,  [12]s  -  [7] 5  =  [0]5,  and  [12]5  •  [7]s  =  [4]s. 

These  definitions  amount  to  defining  three  functions  from  Z„  x  Z„  to  Z„,  and 
we  need  to  show  that  they  are  well-defined.  We  will  provide  a  complete  proof  for 
the  operation  of  multiplication,  and  we  will  leave  addition  and  subtraction  to  you  in 
Problem  27.11. 

Example  27.9.  Define  /  :  Z„  x  Z„  — >  Z„,  by  f([r]n,  [s]„)  =  [rs]n.  Then  /  is  a  well- 
defined  function  that  yields  a  multiplication  on  Z„. 

“Understanding  the  problem.”  To  show  that  a  function  is  well-defined,  we  need 
to  prove  two  things,  (i)  We  must  show  that  /  maps  Z„  x  Z„  into  Z„.  In  other  words, 
we  must  show  that  for  every  element  x  in  Z„  x  Z„,  there  exists  an  element  y  in  Z„ 
such  that  /(a:)  =  y.  It  is  important  to  recall  that  when  an  element  of  a  set  can  be 
written  in  a  special  form,  as  is  the  case  with  x  in  Z„  x  Z„,  we  should  take  advantage 
of  it.  So  if  x  is  in  Z„  x  Z„,  then  there  exist  integers  r  and  s  such  that  x  =  ([r]„,  [j]„). 

(ii)  We  must  also  show  that,  for  x  in  Z„  x  Z„,  if  f(x)  =  y  and  f{x)  =  z,  then  y  =  z. 
Again,  we  will  expect  to  use  the  special  form  of  x,y,  and  z.  Now  x  can  be  written  as 
([/•],,,  [s]„)  for  some  integers  r  and  s.  Our  function  /  is  supposed  to  look  at  the  pair 
of  equivalence  classes,  choose  an  integer  from  each  (they  could  be  r  and  ,v,  but  don’t 
have  to  be),  multiply  these  together,  and  produce  the  resulting  equivalence  class. 

What  could  possibly  go  wrong?  Let  us  look  at  an  example.  We  know  that 
[7]6  =  [19]6,  because  6|  ( 19  —  7).  By  the  definition  of  multiplication  in  Zg,  we  write 
[7]g  •  [4] 6  =  [28]6  and  [19]g  •  [4]g  =  [76]g.  The  left  sides  of  both  equations  are  the 
same,  so  the  right  sides  had  better  be  the  same  as  well,  or  we  have  a  fatal  prob¬ 
lem  on  our  hands.  Are  they  the  same?  Our  proof  will  need  to  show  that  the  result 
of  the  multiplication  operation  is  independent  of  the  particular  integers  we  used  to 
represent  the  equivalence  classes. 

“Devising  a  plan.”  To  prove  part  (i),  we  have  to  show  that  /  is  defined  for  every 
element  of  Z„  x  Z„,  and  yields  an  element  in  Z„.  For  part  (ii),  we  let  x  £  Z„  x  Z„ 
and  suppose  that  f(x)  =  y  and  f(x)  =  z.  We  must  show  that  y  =  z.  Now,  we  can 
assume  that  there  exist  integers  r,s,u,  and  v  such  that  x  =  ([r]„,  [s]„)  =  ([«]„,  [v]„) 
and  y  =  /([r]„,  [s]„)  =  [ri]„,  while  z  =  f([u]„,  [v]„)  =  [uv\„.  Therefore  we  must  show 
that  [rs]n  =  [i uv\n .  By  Example  27.8,  we  know  that  this  means  that  we  must  show 
that  there  exists  an  integer  m  such  that  rs  —  uv  =  mn.  How  can  we  show  that  such 
an  integer  m  exists?  By  using  what  we  know,  namely,  that  ([r]„,  [.?]„)  =  ([«]„,  [v]„). 
Looks  like  we  are  now  ready  to  carry  out  our  plan. 

Proof.  Let  x  £  Z„  x  Z„.  Then  x  =  ([r]„,  [s]„)  for  some  r,s  £  Z.  Hence,  rs  £  Z,  and 
therefore  [rs]n  £  Z„.  By  the  definition  of  /,  we  have  f(x)  =  [rs],,.  Thus,  /  maps 

Z„  x  Z„  to  Z„ . 
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Again  let  x  £  Z„  x  Z„.  We  consider  two  arbitrary  representations  of  x,  say 
x=  ([/-]„,  [s]„)  and  x=  ([«]„, [v]„).  Then  f(x)  =  [rs]„  and  f(x)  =  [uv\n.  We  need 
to  show  that  [rs]„  =  [uv]„.  Since  ([r], ,,[$]„)  =  ([m]„,[v]„),  the  definition  of  ordered 
pair  implies  that  [r]„  =  [u\n  and  [s]„  =  [v]„.  Thus 

u  —  r  =  kn,  for  some  k  £  Z,  and  (27. 1) 

v  —  s  =  In,  for  some  /  £  Z.  (27.2) 

To  show  that  [rs\n  =  \uv\n,  we  calculate 

uv  —  rs  =  uv  —  rv  +  rv  —  rs 
=  (w  —  r)v+  r(v  —  s) 

=  knv  +  rln  (using  equations  (27.1)  and  (27.2)) 

=  (kv  +  rl)n. 

Now  kv  +  rl  £  Z  and,  by  Example  27.8,  [rs]„  =  [uv]„.  Thus  /  is  well-defined,  as 
desired.  □ 

Exercise  27.10.  Define  “modular  exponentiation”  as  follows:  For  an  integer  n  >  1 
and  a,b  £  Z,  define  [a] =  [ab]„.  Either  prove  that  this  operation  is  well-defined, 
or  give  an  example  to  show  that  modular  exponentiation  is  not  well-defined.  O 

In  Z„,  the  operations  of  multiplication  and  addition  are  commutative  and  asso¬ 
ciative.  The  set  Z„  has  an  additive  identity;  that  is,  there  is  an  element  0  £  Z„  such 
that  x+  0  =  x  for  all  x  £  Z„  (namely,  0  =  [0]„).  Similarly,  Z„  has  a  multiplicative 
identity;  that  is,  there  is  an  element  e  £  Z„  such  that  x  ■  e  =  x  for  all  x  £  Z„  (namely, 
e  =  [1]„).  Further,  every  element  of  Z„  has  an  additive  inverse;  that  is,  for  every 
x€Z„  there  is  yx  £  Z„  such  that  x+yx  =  [0]„  (if  x  =  \r\n,  then  — x  =  [— r]„).  Multi¬ 
plication  is  also  distributive  over  addition.  (See  Problem  27.14.)  Thus,  they  satisfy 
everything  you  might  reasonably  hope  an  operation  would  satisfy  except  for  one 
thing:  not  every  nonzero  element  has  a  multiplicative  inverse.  The  following  imme¬ 
diate  consequence  of  Theorem  27.6  tells  us  something  about  reciprocals  in  Z„. 

Corollary  27.11.  Let  n  be  a  positive  integer  with  n  >  1.  Then  for  every  integer  a 
with  gcd (a,n)  =  1,  there  exists  an  integer  b  such  that  ab  =  1  (mod  n). 

Before  proceeding  to  the  proof  of  the  corollary,  write  out  what  it  means  to  say 
that  ab  =  1  (mod  h). 

Proof.  Since  gcd(a./ij  =  1,  Theorem  27.6  tells  us  that  there  exist  b,c  £  Z  such  that 
ba  +  cn=  1.  Then  ba—  1  =  (— c)n  and  —  c  £  Z.  Thus,  ab  =  1  (mod  n ).  □ 

For  an  integer  a  to  satisfy  the  hypothesis  of  this  corollary,  a  needs  to  be  rela¬ 
tively  prime  to  the  modulus  n.  Is  it  possible  that  we  have  two  integers,  a  and  b  with 
a=  b  (mod  n),  such  that  one  of  the  integers,  say  a,  satisfies  the  hypothesis  and  the 
other  one,  b,  does  not?  The  answer  to  this  query  is  no,  as  we  see  from  the  following 
lemma: 
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Lemma  27.12.  Let  a,c,  and  n  be  integers  with  n  >  1  and  such  that  a  =  c  (mod  n). 
Then  gcd(a,n)  =  1  if  and  only  i/gcd(c,«)  =  1.  Further,  ifb  and  d  are  integers  such 
that  ab=\  (mod  n)  and  cd  =  1  (mod  n),  then  b  =  d  (mod  n). 

The  proof  of  this  lemma  requires  the  multiplication  defined  on  Z„  earlier  in  this 
chapter,  and  the  (easily  checked)  algebraic  properties  of  this  multiplication.  (See 
Problem  27.14.) 

Proof.  For  the  first  part  of  the  proof,  we  prove  the  contrapositive;  that  is,  we  prove 
that  if  gcd(c,«)  ^  1,  then  gcA{a,n)  ^  1.  So  assume  that  gcd(c,n)  =  k  >  1.  Since 
a  =  c  (mod  «),  we  conclude  that  a  —  c  =  In  for  some  l  £  Z.  Hence  a  =  c  +  ln.  But 
k\c  and  k\n,  so  k\a.  By  (ii)  in  the  definition  of  gcd,  we  conclude  that  k|gcd(a,n). 
Thus  gcd (a,n)  >  1.  The  converse  is  obtained  by  interchanging  the  roles  of  a  and  c. 

For  the  second  part  of  the  proof,  we  use  our  assumptions:  [«]„[£>]„  =  [ab]„  =  [1]„, 
[c]„[d\n  =  [ cd\n  =  [l]n,  and  [a\n  =  [c\n.  Thus,  we  calculate 

[b\n  =  [ bcd]n  =  [bad],,  =  [ abd]n  =  [d\n, 

and  we  conclude  that  b  =  d  (mod  n).  □ 

Taken  together,  the  corollary  and  the  lemma  tell  us  that  if  an  integer  a  is  relatively 
prime  to  n,  then  there  exists  an  integer  b  such  that  the  equivalence  classes  satisfy 
[a\n  •  \b\n  =  1.  So,  for  a  relatively  prime  to  n,  the  equivalence  class  has  something 
that  should  remind  you  of  a  reciprocal.  This  leads  to  the  following  definition:  For 
a,b ,  and  n£  Z  with  n  >  1,  we  call  h  a  reciprocal  modulo  n  of  a  if  ab  =  1  (mod  n). 
The  notation  is  b  =  a~l  (mod  n). 

Exercise  27.13.  (a)  Find  the  reciprocals  modulo  7  of  3,5,  and  6. 

(b)  Which  elements  of  Zg  have  reciprocals  modulo  6  and  which  do  not?  O 

The  use  of  modular  arithmetic  is  widespread.  Every  time  you  are  on  the  Web, 
your  browser  is  likely  to  make  your  transactions  secure  using  an  encryption  that  is 
based  on  modular  arithmetic.  (Work  Project  29.13  on  codes  to  see  one  such  use.) 
We  motivated  the  ideas  in  the  chapter  using  time  and  calculations  modulo  24.  If  you 
schedule  tasks  by  days  of  the  week  you  probably  want  to  calculate  with  modulus 
7;  if  you  are  interested  in  a  monthly  schedule,  the  modulus  is  12.  In  fact,  now  that 
we’ve  mentioned  it,  you  can  surely  think  of  many  other  times  when  you  have  used 
modular  arithmetic. 


Definitions 

Definition  27.1.  Let  n  £  Z  with  n  >  1.  Integers  x  and  y  will  be  related  if  n\(y  —  x). 
In  this  case,  we  say  that  x  is  congruent  to  y  modulo  n,  and  we  write  x  =  y  (mod  n). 
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Definition  27.2.  For  an  integer  n  >  1,  the  set  of  all  equivalence  classes  with  respect 
to  the  relation  congruence  modulo  n  is  called  the  integers  modulo  n  and  denoted 

by  Z„. 

Definition  27.3.  The  greatest  common  divisor  of  two  integers  m  and  n,  where  m 
and  n  are  not  both  zero,  is  the  positive  integer  d  that  satisfies 

(i)  d\m  and  d\n,  and 

(ii)  if  s  is  a  positive  integer  such  that  s\m  and  s\n,  then  s\d. 

It  is  denoted  by  gcd (m,n). 

Definition  27.4.  Two  integers  m  and  n  are  relatively  prime  if  gcd  (m,n)  =  1. 

Definition  27.5.  For  a,b,  and  ngZ  with  n>  1,  we  call  b  a  reciprocal  modulo  n  of 

a  if  ab  =  1  (mod  n).  The  notation  is  b  =  aTx  (mod  n). 


Solutions  to  Exercises 

Solution  (27.1). 

(a)  We  defined  5  =  x  (mod  12)  by  12  |  (x  — 5).  Some  possible  values  forx  are: 
5, 17, 125,  —7,-115.  Some  values  that  do  not  work  are:  0,7, 1200,  —5,  —  12. 

(b)  The  equivalence  —3  =  39  (mod  x)  is  defined  by  x|42  where  x  is  an  integer 
greater  than  1.  The  set  of  all  positive  factors  greater  than  1  of  42  is  the  set 
A  =  {2,3, 6, 7, 14,21,42}.  Any  five  integers  from  A  will  work.  The  five  non¬ 
solutions  must  be  chosen  from  the  integers  greater  than  1  that  are  not  in  A. 

Solution  (27.4). 

(a)  Condition  (i)  says  that  the  greatest  common  divisor  divides  both  integers.  In 
other  words,  it  is  a  statement  about  being  a  common  divisor. 

(b)  Condition  (ii)  says  that  every  other  positive  integer  that  divides  both  m  and  n 
also  divides  the  gcd(m,n),  and  therefore  is  a  factor  of  it.  In  other  words,  the 
second  condition  explains  the  choice  of  the  word  “greatest.” 

(c)  We  will  need  to  prove  the  uniqueness  of  the  greatest  common  divisor.  To 
do  so,  we  will  prove  that  if  there  are  integers  d\  and  dz,  both  satisfying  the 
definition  of  greatest  common  divisor,  then  d\  =  di. 

Solution  (27.5).  We  list  the  answers  here:  gcd(  — 16,40)  =  8,  gcd(0,45)  =  45,  and 
gcd(— 30,  — 27)  =  3. 

Solution  (27.7).  The  integers  m  and  n  are  given  and  n  >  1.  By  Theorem  27.3,  there 
are  q,  r  £  Z  such  that  m  =  nq  +  r  and  0  <  r  <  n.  Hence  m  —  r  =  nq  for  some  q  £  Z. 
Thus  m  =  r  (mod  n)  and  0  <  r  <  n. 

Solution  (27.10).  This  is  not  well-defined.  Let  n  =  5.  Then  2  =  7  (mod  5).  Now 
[3]P15  =  [4] 5  and  [3]{-71s  =  [2]5,  but  4  ^  2  (mod  5). 
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Solution  (27.13). 

(a)  Check  that  3_1  =  5  (mod  7),  5-1  =  3  (mod  7),  and  6_1  =  6  (mod  7). 

(b)  By  Corollary  27.1 1,  the  integers  1  and  5  have  reciprocals  modulo  6;  it  is  easy 
to  check  that  none  of  the  others  does. 


Problems 

Problem  27.1.  Let  a,b,c,x,  and  y  £  Z.  Prove  that  if  a\b  and  ci\c,  then  a \  ( bx  +  cy) . 

Problem  27.2.  Let  a  and  b  be  positive  integers  such  that  a\b  and  b\a.  Prove  that 
a  =  b. 

Problem  27.3.  Prove  Theorem  27.2.  (Note  that  this  generalizes  part  (c)  of  Prob¬ 
lem  10.2.) 

Problem  27.4.  Carefully  read  the  definition  of  greatest  common  divisor.  What 
should  the  least  common  multiple  of  two  integers  be?  Make  up  a  definition  for  it. 
The  least  common  multiple  of  two  integers  m  and  n  is  denoted  by  lcm (m,n). 

Problem  27.5.  Using  your  definition  from  Problem  27.4  and  the  notation  defined 
above,  prove  that  if  m  and  n  are  positive  integers,  then 

gcd(m,n)  -lcm(w,n)  =  mn. 

Problem  27.6.  Let  m,n  £  Z,  not  both  zero.  Suppose  that  a  is  an  integer  that  divides 
both  m  and  n  and  whenever  s  is  an  integer  dividing  both  m  and  n,  then  s  <  a.  Prove 
that  a  =  gcd(»i,n). 

Problem  27.7.  Let  m,n  £  Z  and  assume  that  m  /  0.  Prove  the  following  state¬ 
ments. 

(a)  For  all  positive  integers  k,  we  have  gcd (mk,nk)  =  k  gcd(m,n). 

(b)  If  d  =  gcd(m,n)  and  k.l  £  Z,  then  gcd(m,n)|  gcd(m  +  kd,n  +  Id). 

Problem  27.8.  Let  p  be  a  prime  number  and  a, b  £  Z.  Prove  that  if  p\ab,  then  p\a 
or  p\b.  (You  may  use  Theorem  27.6  to  solve  this  problem.) 

Problem  27.9.  Here  is  another  theorem  that  you  have  been  using  for  a  long  time. 

Theorem  27.14  (Fundamental  theorem  of  arithmetic).  Every  integer  n,  where 
n  >  2,  is  the  product  of  prime  numbers.  This  factorization  is  unique,  up  to  the  order 
of  the  factors. 

If  you  worked  Problem  18.21,  then  you  already  proved  that  such  a  factorization 
exists.  You  will  still  need  to  prove  uniqueness  of  the  factorization.  (You  may  use 
induction  and  the  result  of  Problem  27.8  to  do  so.) 
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Problem  27.10.  Let  p  >  1  be  an  integer  with  the  property  that  for  all  integers  a  and 
b,  if  p\ab,  then  p\a  or  p\b.  Prove  that  p  is  prime. 

(In  your  future  mathematics  courses  you  will  see  that  this  is  a  more  useful  defi¬ 
nition  of  prime  than  the  one  to  which  you  have  become  accustomed.) 

Problem  27.11.  Let  n  >  1  be  an  integer. 

(a)  Define  g:Znx  Z„  — > Z„  by  g([r]n,  [s]„)  =  [r+s]„.  Prove  that  g  is  well-defined. 

(b)  Define  h :  Z„  xZ„->  Z„  by  h([r\n,  [$]„)  =  [r  —  s]„.  Prove  that  h  is  well-defined. 

Problem  27.12.  Define  / :  Z12  — >•  Z24  by  /([*]  12)  =  [3x]24-  Is  /  well-defined?  Prove 
your  claim. 

Problem  27.13.  Let  p  be  an  odd  prime  and  define  /  :  Zp  — >  Zp  by  f([x\p)  =  [2x] p . 
Prove  that  /  is  well-defined  and  that  /  is  a  bijection. 

Problem*  27.14.  Let  n  >  1  be  an  integer.  Using  the  addition  and  multiplication  de¬ 
fined  on  Z„  in  this  chapter,  prove  the  following  statements: 

(a)  ([a]n  +  [b\„)  +  [c]n  =  [a\n  +  ([b\n  +  [c]„)  for  all  a,b,c£  Z; 

(b)  there  is  an  integer  0  G  Z  such  that 

(i)  [a]n  +  [0]„  =  [0]„  +  [a]„  =  \a]n  for  all  a  €  Z,  and 

(ii)  for  every  a  £  Z,  there  is  b  £  Z  such  that  [a\n  +  [b\n  =  [b]n  +  [a\n  =  [0]„; 

(c)  [a)n  +  [ b]n  =  [b\n  +  [a]„  for  all  a,  b  G  Z; 

(d)  \[a}n  ■  [b\n)  ■  [c]n  =  [a}„  ■  ([b\n  ■  [c]„)  for  all  a,b,c  £  Z; 

(e)  [a}„  ■  (\b\n  +  [c]„)  =  [a\n  ■  [b\n  +  [a}„  ■  [c]„  and 

{[a]n  +  \b\n)  ■  [c]n  =  [«]n  •  [c]n  +  [b]n '  [c]n  for  all  a,b,c  G  Z; 

(f)  there  is  an  element  e  G  Z  such  that  [a]n  ■  [e]n  =  [e]„  ■  [a\„  =  [a]„  for  all  a  G  Z; 

(g)  [a]n  ■  [ b\n  =  \b\n  ■  [a]„  for  all  a,beZ. 

(A  set  with  two  well-defined  operations  satisfying  (a)-(g)  is  called  a  “commutative 
ring  with  identity.”) 

Problem  27.15.  You  may  know  the  following  criterion  for  divisibility  by  nine:  A 
positive  integer  is  divisible  by  nine  if  and  only  if  the  sum  of  its  digits  is  divisible  by 
nine. 

(a)  Prove  this  criterion. 

(b)  Is  47832973  divisible  by  9? 

A  variation  of  the  divisibility  criterion  is  the  calculation  check  called  “casting  out 
nines.”  We  may  use  this  to  check  an  addition,  subtraction,  multiplication,  or  division 
(turning  the  division  into  a  multiplication  before  checking)  of  integers  by  replacing 
each  integer  by  the  sum  of  its  digits  (repeating  the  summation  until  the  integer  is  a 
single  digit).  If  our  computation  does  not  pass  this  check,  it  was  incorrect.  (However, 
it  may  pass  the  check  and  still  be  an  incorrect  computation!)  For  instance:  To  check 
that 

56782351912/25785  =  2202146  plus  a  remainder  of  17302, 
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we  first  write  this  as 


56782351912  =  25785-2202146+17302. 

Now  we  “cast  out  nines”: 

4  =  0  •  8  +  4, 

and  we  see  that  our  “check”  is  consistent  with  the  process  of  casting  out  nines. 
However,  if  we  had 


56782341912  =  25785-2202146+17202, 
our  “check”  would  yield 

4  =  0-  8  +  3, 

and  we  would  be  alerted  to  an  error. 

(c)  Explain  “casting  out  nines”  (use  your  work  from  part  (a)). 

(d)  Find  an  example  that  shows  that  it  is  possible  to  have  a  situation  in  which  the 
check  works,  but  the  original  computation  is  incorrect. 

Problem  27.16.  Let  n  be  an  integer  with  n  >  1 .  Prove  that  the  following  are  equiv¬ 
alent. 

1.  For  all  m,  if  m  ^  0  (mod  n),  then  m  has  a  reciprocal  modulo  n. 

2.  The  integer  n  is  prime. 

(This  makes  Zp,  where  p  is  prime,  as  good  a  set  to  do  arithmetic  in  as  Q.  As  far 
as  multiplication  is  concerned,  Zp  is  better  than  Z,  because  very  few  numbers  (two, 
to  be  exact)  in  Z  have  reciprocals  that  also  lie  in  Z  .  The  set,  Zp,  for  p  a  prime,  is 
what  is  called  a  “field”  in  mathematics.  Other  examples  include  Q,R,  and  C.  The 
set  Z  is  not  a  field.) 

Problem  27.17.  (This  problem  is  appropriate  only  if  you  studied  Chapter  22.)  Use 
the  result  of  Exercise  27.7  to  show  that  |Z„|  =  n. 

Problem  27.18.  Find  all  solutions  in  Z„  for  the  following  equivalences: 

(a)  3a  =  0  (mod  12); 

(b)  3a-  =  0  (mod  17); 

(c)  3a  =  0  (mod  10). 

Problem  27.19.  Find  all  solutions  in  Z„  for  the  following  equivalences: 

(a)  4a  =  1  (mod  11); 

(b)  4a  =1  (mod  9); 

(c)  3a  =  1  (mod  11); 

(d)  3a  =1  (mod  9). 
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Problem*  27.20.  Here’s  a  brief  explanation  of  the  Euclidean  algorithm,  which  is 
an  effective  way  to  find  the  greatest  common  divisor  of  two  integers  m  and  n,  not 
both  zero.  This  algorithm  is  in  the  seventh  book  of  Euclid’s  Elements,  but  was  likely 
known  earlier. 

There  are  two  trivial  cases  that  must  be  considered  before  moving  to  the  inter¬ 
esting  one.  If  m  =  n,  then  the  greatest  common  divisor  is  obviously  \m\.  If  one  of 
the  integers  is  zero  (remember  that  both  can’t  be  zero),  then  the  greatest  common 
divisor  is  the  absolute  value  of  the  nonzero  integer.  Now  for  the  main  case,  note  that 
the  positive  divisors  of  an  integer  m  are  the  same  as  the  ones  of  —in.  For  this  reason, 
we  may  assume  that  both  m  and  n  are  positive.  After  possible  relabeling  of  the  two 
numbers,  we  may  further  assume  that  m>  n>  0. 

The  Euclidean  algorithm  is  a  repeated  application  of  the  division  algorithm.  The¬ 
orem  27.3.  Each  line  is  obtained  from  the  previous  one  by  shifting  the  divisor  to  the 
spot  previously  occupied  by  the  dividend,  and  the  remainder  to  the  spot  previously 
occupied  by  the  divisor.  It’s  easier  to  see  than  to  say.  Here  is  the  way  to  see  it: 

m  =  q\n  +  r\ , 
n  =  qm+r2, 

=  <?3r2  +  r3j 


rk—3  =  qk-m-i  +  rk-i, 
rit-2  =  qkn-\  +  rk , 

rit-i  =  qk+m- 

By  the  division  algorithm,  the  remainders  satisfy  the  inequalities 
n  >  r\  >  . . .  >  n  >  ri+i  >  . . .  >  0. 

This  guarantees  that  the  algorithm  comes  to  a  halt  after  finitely  many  steps.  We  label 
the  last  nonzero  remainder  rk  and  solve  for  rk  as  follows: 


n  =  n-2-qkrk-i 

=  rk-2  - qk(rk-3  - qk-m-i)  =  -qkrk-3  +  {\  +  qkqk-\)rk-2 


=  xom  +  yon. 

It  can  be  shown  (but  we  won’t  ask  you  to  do  it)  that  rk  =  gcd (m,n). 

We’ll  work  out  one  example  for  you,  so  you  can  see  how  this  is  done.  We  will 
find  the  greatest  common  divisor  of  8  and  27  and  express  it  as  a  linear  combination 
of  the  given  integers.  Now  we  need  m  >  n,  so  m  =  27  and  n  =  8.  We  now  proceed 
with  the  algorithm.  The  remainders  are  underlined,  and  will  be  replaced  with  what 
we  obtained  in  the  column  on  the  left. 
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27  =  3-8  +  3 
8  =  2-3  +  2 
3  =  1-2  +  1 
2  =  2-1 


so 


1=3  — 1-2 

=  3  —  1  •  (8  -2  -  3)  =-8  +  3-3 

=  -8  +  3(27-3-8)  =  3-27-  10-8 


So  our  algorithm  tells  us  that  1  =3-27  —  10-8,  and  you  can  now  check  that  this 
answer  is  correct. 

You’ll  understand  the  algorithm  better  if  you  use  it  to  calculate  the  gcd  of  two 
numbers.  Do  so  for  the  following  pairs  of  integers  (m,n)  and  find  the  corresponding 
integers  xq  and  yo: 

(a)  (2745,135); 

(b)  (528,627); 

(c)  (4746,894). 


Problem  27.21.  Use  the  Euclidean  algorithm  of  Problem  27.20  to  show  that  2542 
and  4095  are  relatively  prime. 

Problem  27.22.  On  a  calculator  or  a  computer,  program  the  Euclidean  algorithm  as 
outlined  in  Problem  27.20.  Check  your  program  by  trying  it  out  on  parts  (a)  through 
(c)  in  that  problem. 

Problem  27.23.  In  the  text  we  defined  what  it  means  for  an  integer  p  to  be  prime. 
We  also  defined  what  it  means  for  two  integers  a  and  b  to  be  relatively  prime.  Give  an 
alternate  definition  for  an  integer  p  to  be  prime  by  requiring  a  and  p  to  be  relatively 
prime  for  certain  integers  a.  Prove  that  the  original  and  the  alternate  definition  of 
prime  are  equivalent. 

Problem  27.24.  It  is  possible  to  define  a  function  /  that  tells  you  the  day  of  the 
week  your  birthday  will  fall  on  each  year.  To  construct  such  a  function,  you  need  to 
find  out  what  day  of  the  week  you  were  born.  (Encode  the  weekdays  as:  0 — Sunday, 
1 — Monday,  and  so  on.)  Letting  s  denote  the  encoded  week  day  of  your  birth,  a 
the  year  you  were  born,  and  b  the  year  in  which  you  want  to  know  the  weekday  of 
your  birthday,  you  will  need  to  define  /  in  terms  of  s,  a,  and  b.  Thus,  the  required 
function  /  will  be  a  map  from  Z7  x  Z  x  Z  into  Z7.  (The  formula  will  depend  on 
whether  your  birthday  is  before,  after,  or  on  February  29  of  your  birth  year.) 

Use  modular  arithmetic,  but  keep  in  mind  that  there  are  leap  years.  (The  year 
2000  was  a  leap  year.  The  formula  becomes  considerably  more  complicated  if  you 
want  to  extend  it  past  2100,  because  that  year  will  not  be  a  leap  year.) 

To  find  out  the  day  of  your  birth  and  to  check  your  formula,  access  one  of  the 
perpetual  calendars  on  the  Web  such  as  [38]. 


Chapter  28 

Fermat’s  Little  Theorem 


We  begin  this  chapter  with  a  fundamental  result  of  number  theory,  discovered  by 
Pierre  de  Fermat.  Fermat  lived  from  1601  to  1665.  Many  of  his  contemporaries  were 
“number-lovers”  rather  than  number  theorists,  [108,  p.  51],  and  one  thing  that  inter¬ 
ested  them  was  perfect  numbers  (a  number  is  perfect  if  it  is  the  sum  of  all  its  proper 
divisors).  Bernard  Frenicle  de  Bessy,  who  was  also  a  mathematician  and  physicist, 
first  raised  the  question  of  whether  there  was  a  perfect  number  of  20  digits  and,  if 
not,  what  the  next  largest  perfect  number  was.  (See  [29]  and  [30].)  The  answer  to  the 
question  required  determining  whether  certain  large  numbers  were  prime.  As  a  con¬ 
sequence,  the  men  began  corresponding.  In  a  letter  to  Frenicle,  dated  October  18, 
1640,  Fermat  stated  what  is  now  known  as  Fermat’s  theorem  or  Fermat’s  little  the¬ 
orem  (to  distinguish  it  from  Fermat’s  last  theorem),  but  he  did  not  include  a  proof. 
In  1736,  almost  a  century  later,  Leonhard  Euler  gave  the  first  rigorous  proof  of  the 
little  theorem.  Though  this  theorem  is  clearly  theoretical  in  nature,  it  plays  an  im¬ 
portant  role  in  primality  testing;  that  is,  in  deciding  whether  or  not  a  certain  number 
is  prime.  Fermat’s  little  theorem  (in  the  form  due  to  Euler)  is  also  the  mathematical 
heart  of  the  widely  used  RSA  code  that  we  will  describe  later  in  this  chapter.  In  fact, 
Fermat’s  little  theorem  is  not  little  at  all. 

Theorem  28.1  (Fermat’s  Little  Theorem).  Let  p  be  a  prime  and  let  a  be  an  integer 
satisfying  gcd (a,p)  =  1.  Then 


ap  1  =  1  (mod  p). 


Exercise  28.2.  Verify  Theorem  28.1  for  a  few  values  of  p  and  a.  Q 

We  will  state  and  prove  Euler’s  generalization  of  this  theorem  below.  Fermat’s 
little  theorem  will  then  follow  as  a  special  case. 

In  order  to  state  the  form  of  the  theorem  that  we  will  prove,  we  will  introduce  a 
new  function:  Euler’s  0-function  is  the  function  0  :  Z+  — >  Z+  where  (pin)  denotes 
the  number  of  integers  k,  with  0  <  k  <  n,  that  are  relatively  prime  to  n. 
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Exercise  28.3.  Calculate  0(1), 0(12), 0(7), 0(13),  and  0(7  •  13).  O 

Exercise  28.4.  Show  that  if  p  is  prime,  then  0  (p)  =  p  -  \  .  Q 

The  following  lemmas  will  assist  us  in  our  proof  of  Theorem  28.7  below.  The 
first  lemma  requires  the  multiplication  in  Z„  that  we  defined  in  Chapter  27. 

Lemma  28.5.  Let  n  and  a  be  integers  satisfying  n  >  1  and  gcd(a,n)  =  1.  If  r  and  s 
are  integers  satisfying  ar  =  as  (mod  n),  then  r  =  s  (mod  n). 

Proof  Since  gcd(a,«)  =  1,  we  may  apply  Corollary  27.11  to  obtain  an  integer  b 
such  that  ab  =  1  (mod  n ).  We  now  multiply  the  equivalence  ar  =  as  (mod  n)  by 
b  to  get  arb  =  asb  (mod  n).  Using  commutativity  of  the  multiplication  (see  Prob¬ 
lem  27.14)  and  simplifying,  we  obtain  r  =  s  (mod  n),  as  desired.  □ 

We  summarize  much  of  what  we  have  learned  below. 

Lemma  28.6.  Let  a  and  n  be  integers  with  n  >  1  and  gcd (a,n)  =  1.  Then  there  exist 
exactly  0  (n)  distinct  integers,  m\ , m2, . . . , m^p,)  such  that 

(i)  0  <  m,-  <  n  and  gcd (m,-,n)  =  1  for  i  =  1, . . . ,  0(n), 

(ii)  there  exists  c  €E  Z  such  that 

/m  \ 

FT  m'  c  1  (mod  n),  and 

V'=l  / 

(Hi)  anii^anij  (mod  n)  for  i  f  j. 

Proof  By  the  definition  of  Euler’s  0-function,  there  exist  exactly  0(n)  distinct  in¬ 
tegers  satisfying  (i). 

Using  Problem  28.1,  it  follows  from  property  (i)  that  gcd(nfi"1  m,-,n)  =  1.  Thus 
we  may  apply  Corollary  27.1 1  to  obtain  an  integer  c  such  that 


/>(»)  \ 

FT  m‘  c  =  1  (mod  n), 


completing  the  proof  of  (ii). 

For  part  (iii)  recall  thatgcd(a,n)  =  1.  Now  m\ , . . .  ,m^(„)  are  distinct  integers  with 
0  <  <  n  for  each  k.  So,  if  i  f  j,  then  in,  f  mj  (mod  n) .  Thus  (the  contrapositive 

of)  Lemma  28.5  implies  that  am,  ^  am:  (mod  n)  for  i  ^  j.  □ 

Now  we  are  ready  for  Euler’s  generalization  of  Fermat’s  little  theorem. 

Theorem  28.7  (Euler’s  Theorem).  Let  a,n  £  Z  with  n  >  1.  If  gcd(a,«)  =  1,  then 


a 


=  1  (mod  n). 
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Proof.  Let  1111,1112 , . . .  ,m^ >(„)  be  as  in  Lemma  28.6.  Then  am\,ani2,  ■  ■  ■  ,am^n^  are 
distinct  integers  (mod  n),  and  gcd (m,,n)  =  1.  Note  that  since  there  are  (j>(n)  inte¬ 
gers,  we  have 

0(«)  <K”) 

a<Kn)  m.  =  YY^amj').  (28.1) 

i=  1  i=  1 

Thus,  if  we  can  find  IT f=i(ami)  as  well  as  the  reciprocal  of  flfi”*  w,-  (mod  n),  we 
can  also  compute  (mod  n). 

To  compute  the  product  in  equation  (28.1),  use  Exercise  27.7  to  obtain  <j>(n)  inte¬ 
gers,  .51,52,  •  •  •  ,S0(n)  such  that  Si  =  anij  (mod  n)  and  0  <  s,  <  n.  Now,  gcd(a,n)  =  1 
and  gcd (m,-,«)  =  1,  so  by  Problem  28.1,  gcd(am,,n)  =  1.  Thus,  Lemma  27.12  im¬ 
plies  that  gcd(s;,n)  =  1.  We  have  found  <j>(ii)  different  integers,  si,S2,  ■  ■  ■ 
all  relatively  prime  to  n.  So  ,V|  ,52. . . .  ,  is  simply  a  (possible)  reordering  of 
mi, m2,  ■  ■  ■  ,m^ny  Consequently 


<A(«)  <Hn)  0(«) 

^Y(anij)  =  Si  =  nii  (mod  n).  (28.2) 

i=  1  i=l  i=l 

Combining  (28.1)  and  (28.2)  we  get 

</>(")  0(«) 

a*{">  n  m,  =  n  m<  (mod  n).  (28.3) 

(=1  i=i 

By  Lemma  28.6,  there  is  an  integer  c  such  that  (Ilfii*  mi)c  =  1  (mod  n).  So,  mul¬ 
tiplying  both  sides  of  (28.3)  by  c,  we  obtain 

/  <#>(«)  \  /  <^>(«)  \ 

(  J“j  mj  \  c  =  I  »(;  I  c  (mod  /1). 

Using  associativity  and  simplifying,  we  obtain  a^(”)  =  1  (mod  n),  as  desired.  □ 

Since  Fermat’s  little  theorem  is  a  special  case  of  Euler’s  theorem,  with  n  =  p  for 
a  prime  p  and  (j)  ( p )  =  p  —  1,  we  now  have  a  proof  of  Theorem  28. 1  as  well. 

One  interesting  application  of  Euler’s  theorem  is  in  an  area  of  mathematics 
known  as  coding  theory.  Here’s  the  idea:  Suppose  you  want  to  transmit  a  message 
to  a  receiver,  whom  we  shall  refer  to  as  Henry,  in  such  a  way  that  no  one  else  can 
read  it.  This  is  done  all  the  time.  (Just  think  how  often  you  have  sent  your  credit 
card  number  over  the  Internet!)  The  idea  is  to  use  a  code  that  is  difficult  to  decode. 
But  of  course,  if  it’s  too  difficult,  Henry  won’t  be  able  to  decode  it  either.  Applying 
the  code  to  our  secret  message  is  like  applying  a  function.  Henry  needs  to  undo  the 
code,  or  mathematically  speaking,  apply  the  inverse  function.  So  we  need  something 
like  a  function  that  has  an  inverse,  but  whose  inverse  is  very  difficult  to  find.  Such 
functions  are  called  trapdoor  functions.  (Anyone  can  get  in,  but  only  Henry  can  get 
out.)  Since  it  is  virtually  impossible  to  find  the  inverse  function,  the  original  func- 
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tion  used  to  hide  the  message  can  be  made  public.  This  is  achieved  using  a  method 
called  a  public  key  encryption,  which  includes  a  public  key  and  a  private  key. 

One  particular  trapdoor  function  leads  to  the  following  method.  Henry,  the  re¬ 
ceiver  of  the  messages,  decides  on  a  function  that  is  determined  by  the  two  integers, 
n  and  <?,  called  the  key  of  the  code.  Anybody  who  is  interested  can  learn  about  these 
two  numbers  (this  is  why  it  is  a  public  key  encryption).  If  you  want  to  send  a  mes¬ 
sage  to  Henry,  then  you  first  turn  the  English  text  into  a  positive  integer  m,  called  the 
plaintext.  There  are  standard  ways  to  do  this,  but  the  number  produced  does  not  yet 
hide  the  message.  (If  the  translation  leads  to  a  number  m  that  is  greater  than  n,  the 
message  must  be  divided  into  several  smaller  messages.)  The  plaintext,  m,  must  now 
be  scrambled  so  that  its  meaning  cannot  be  deciphered  by  anyone  except  Henry.  Or, 
mathematically  speaking,  we  have  to  apply  the  trapdoor  function  to  it.  A  simple  but 
very  safe  way  to  do  this,  is  to  change  m  to  me  (mod  n).  It’s  interesting  to  note  that 
though  it  appears  that  everyone  has  all  the  information  Henry  has,  it  turns  out  that 
Henry  knows  something  no  one  else  knows.  We’ll  explain  this  once  we  tell  you  how 
Henry  will  unscramble  the  message.  So  the  question  is:  How  can  Henry  recover  m 
from  me  (mod  n)l  It  turns  out  that  he  will  use  Euler’s  theorem.  Here’s  how: 

Example  28.8.  Let  m,n,  and  e  be  positive  integers  satisfying  n  >  1,  gcd (m,n)  =  1, 
and  gcd (<?,0(n))  =  1.  Find  a  positive  integer  d  such  that  (me)  =  m  (mod  n). 

Why  is  this  example  relevant?  Well,  once  you  have  d,  you  have  me,n,e,  and  d. 
You  can  then  calculate  (me)d ,  which  (as  we  learned  in  this  example)  is  equivalent  to 
m  modulo  n. 

In  order  for  the  solution  to  this  problem  to  be  useful,  we  need  a  constructive  way 
to  find  d.  We  claim  that  (i)  we  can  find  an  integer  d  such  that  e  d=  1  (mod  <\>(n)) 
and  that  (ii)  any  such  integer  will  fulfill  the  requirement  ( me)d  =  m  (mod  n). 

Now  since  e  is  a  positive  integer  relatively  prime  to  (j)  in).  Theorem  27.6  guaran¬ 
tees  the  existence  of  integers  k  and  /  such  that  1  =  ke  + 1  (j)(n).  Let  d  be  the  smallest 
positive  integer  such  that  d  =  k  (mod  0(«)).  Then  1  =  de  (mod  0(«)),  which  is 
what  we  needed  to  show. 

For  part  (ii),  calculate  ( me)d  =  med,  and  recall  that  m =  l  (mod  n),  by  Euler’s 
theorem.  We  just  showed  that  1  =  ed  +  p  (n)  for  some  j  e  Z,  so 

med  =  7B1-70(n)  =  m  ■  =  m  •  1  =  m  (mod  n ). 

Note  that  in  Problem  27.20  we  gave  a  constructive  method  to  find  the  integers  k 
and  /  used  above,  and  in  Exercise  27.7,  you  showed  how  to  get  the  integer  d  from 

k.  O 

Now  back  to  Henry.  Remember  that  he  has  determined,  rather  carefully,  his  n  and 
e  and  has  given  out  these  two  integers.  He  also  calculated  the  very  important  integer 
d  from  Example  28.8,  but  kept  it  a  secret.  Now  you  may  well  be  asking  the  question, 
“Why  can’t  everyone  with  access  to  n  and  e  calculate  cl  themselves,  and  then  read 
the  messages  meant  for  Henry?”  The  reason  is  that  in  order  to  find  d,  a  person  needs 
to  know  the  modulus  that  determines  d ,  namely,  @(n).  Henry  knows  (as  you  will 
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once  you  work  Problem  28. 12)  that  if  he  chooses  n  such  that  it  is  the  product  of  two 
primes  p\  and  p2,  then  0(n)  =  ( p\  —  l)(/?i  —  1).  So  he  lets  p\  and  p2  be  two  primes, 
each  about  300  digits  long,  following  current  recommendations.  (He  must  be  a  little 
careful  choosing  p\  and  p2,  but  we  will  not  go  into  that  here.)  Now  he  and  everyone 
else  knows  the  product  n,  but  not  p\  and  p2-  This  is  the  trapdoor.  Henry  knows  n,p i, 
and  p2 .  So  he  can  find  0(11).  But  everyone  else  only  knows  n,  so  they  would  have 
to  find  pi  and  p2-  It  takes  no  time  at  all  to  multiply  two  300-digit  numbers,  but  you 
cannot  factor  the  product  in  a  million  years,  not  even  with  supercomputers!  There  is 
a  second  reason  that  Henry  had  to  know  0(«):  he  needed  e  to  be  relatively  prime  to 
000- 

There  is  one  more  thing  that  we  should  mention.  In  Example  28.8  we  have 
the  additional  condition  gcd (m,n)  =  1  (see  also  Problem  28.19).  Thus,  it  appears 
from  our  solution  above  that  we  also  need  gcd (m,n)  =  1.  Fortunately  our  decod¬ 
ing  method  still  works  if  we  have  n  =  p\P2  for  two  different  primes  p  \  and  pi  and 
gcdfm.  n)  =  p\.  If  you  work  Project  29.13,  you  will  prove  this  in  Lemma  29.16. 

Henry’s  method  to  get  secure  messages  is  called  the  RSA  public  key  encryp¬ 
tion,  and  Example  28.8  is  the  mathematical  content  of  it.  To  learn  more  about  this 
ingenious  and  widely  used  method,  work  Project  29.13  on  the  RSA  Code. 


Definition 

Definition  28.1.  Euler’s  0  -function  is  the  function  0  :  Z+  — >  Z+  where  0(11)  de¬ 
notes  the  number  of  integers  k,  with  0  <  k  <  n,  that  are  relatively  prime  to  n. 


Solutions  to  Exercises 

Solution  (28.2).  We  calculate  two  examples. 

1.  Let  p  =  5  and  a  =  7.  Then  74  =  2401  =  1  (mod  5). 

2.  Let  p  =  13  and  a  =  8.  Then  812  =  68719476736.  Now  8 12  -  1  =  68719476735 
=  13  •  5286113595.  Hence  812  =  1  (mod  13). 

Solution  (28.3).  0  ( 1 )  =  1 . 

The  nonnegative  integers  smaller  than  12  are  all  listed.  We  cross  out  the  ones  that 
are  not  relatively  prime  to  12:  0,l,^,^,jf,5,$,7,$,(l,l/)ll.  Hence  0(12)  =4. 
Similarly,  0(7)  =  6, 0(13)  =  12,  and  0(7  •  13)  =  0(91)  =  72. 

Notice  that,  in  these  examples,  for  p  and  q  different  primes  0  ( p )  =  p  —  1  and 
0  (p '  l)  =  0  (p)  '  0(?)-  Are  these  coincidences? 

Solution  (28.4).  Note  that  for  p  prime  and  a  an  integer  with  0  <  a  <  p,  we  have 
gcd(a,p)  =  1  if  and  only  if  a  ^  0.  Thus  0  ( p)  =  p  —  1  for  every  prime  p. 
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Spotlight:  Public  and  Secret  Research 

Research  in  mathematics  today  is  often  done  by  professors  who  work  at  universi¬ 
ties  or  colleges.  People  frequently  work  collaboratively,  though  they  also  sometimes 
work  alone.  They  might  communicate  via  e-mail,  get  together  when  they  can,  work 
together  at  institutes,  or  they  may  never  even  meet  each  other.  Once  their  work  is 
done,  they  write  it  up  and  send  it  to  a  journal.  The  editor  of  the  journal  sends  it  to 
carefully  selected  referees  who  read  the  paper.  The  author  is  responsible  for  the  cor¬ 
rectness  of  the  mathematics  in  the  paper,  but  the  referee  (whose  identity  is  generally 
hidden  from  the  author)  determines  the  value  of  the  work,  the  appropriateness  of  its 
placement  in  the  journal,  the  originality  of  the  mathematics,  and  often  the  correct¬ 
ness  of  the  results.  Once  the  paper  appears,  everyone  has  access  to  the  results  and 
proofs  in  the  paper. 

There  are  also  other  places  where  mathematical  research  is  done.  In  the  United 
States,  the  National  Security  Agency  (NSA)  refers  to  itself  as  the  “leading  employer” 
of  nonacademic  mathematicians.  In  Great  Britain,  there  is  the  Government  Commu¬ 
nications  Headquarters  (GCHQ),  the  successor  to  the  famous  Bletchley  Park  where 
British  code  breakers  were  so  successful  in  intercepting  and  reading  Nazi  attack 
plans.  The  mathematics  done  in  a  place  like  this  might  become  the  government’s 
secret,  and  therefore  may  never  be  published.  Public  key  encryption  is  an  example 
of  how  such  secrecy  may  hamper  mathematical  progress. 

In  1976,  Whitfield  Diffie,  Martin  Heilman,  and  Ralph  Merkle  developed  the  idea 
of  the  public  key.  In  1977,  Ronald  Rivest,  Adi  Shamir,  and  Leonard  Adleman  gave 
us  the  RSA  code.  A  couple  of  decades  later,  it  became  known  that  British  mathe¬ 
maticians  at  GCHQ  had  worked  out  the  encryption  idea  a  few  years  earlier.  James 
Ellis,  Clifford  Cocks,  and  Malcolm  Williamson,  all  employed  at  GCHQ,  had  discov¬ 
ered  public  key  encryption,  but  neither  they  nor  their  supervisors  realized  the  power 
and  widespread  applicability  of  the  method.  Their  results  were  considered  top  se¬ 
cret  and  were  only  circulated  within  the  agency.  A  few  years  later,  the  world  admired 
the  “new”  cryptosystem  and  celebrated  the  “originators”  in  the  United  States,  [99, 
Chapter  6]. 

Some  private  companies  also  restrict  their  employees’  publications.  There  are 
instances  of  researchers  circumventing  this  restriction  by  publishing  under  a  pen- 
name.  In  1908,  William  S.  Gosset,  who  worked  for  the  Guiness  Brewing  Company 
in  Dublin,  published  a  paper  under  the  name  “Student”  to  avoid  repercussions.  The 
distribution  Gosset  introduced  is  now  known  as  Student’s  t- distribution  (or  the  Stu¬ 
dent  t-distribution)  and  it  has  had  a  profound  impact  on  statistical  theory  and  prac¬ 
tice. 

To  learn  more  about  the  interesting  history  of  public  key  encryption  we  recom¬ 
mend  [99,  Chapter  6]  or  [61]. 

An  in-depth  treatment  of  Fermat’s  little  theorem,  Euler’s  theorem,  and  Euler’s 
(j) -function,  as  well  as  historical  notes  can  be  found  in  the  text  [15,  pp.  91-96  and 
123-150].  In  [57,  pp.  418-420  and  556-558]  Fermat’s  theorem  and  Euler’s  theorem 
are  put  in  context.  For  short  biographies  of  Pierre  de  Fermat  and  Leonhard  Euler, 
see  the  Web  at  [79].  For  a  biographical  sketch  of  Euler  and  a  delightful  description 
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of  Euler’s  mathematics  in  the  various  fields,  see  [22].  A  good  source  to  learn  more 
about  Fermat’s  life  and  his  mathematics  is  [66]. 


Problems 

Problem*  28.1.  (a)  Let  a,  b,  such  that  gcd  (a,  s)  =  1  and  gcd(b,s)  =  1.  Show 

that  gcd [ab,s)  =  1. 

(b)  Let  n  £  Z+  and  and  s  be  integers  such  that  gcd(ayt,s)  =  1  for  all 

integers  k  with  1  <  k  <  n.  Show  that  gcd(n^=i  ak,s)  =  1. 

Problem  28.2.  Show  that  the  conclusion  of  Fermat’s  little  theorem  (Theorem  28.1) 
may  not  hold  if  p  is  not  prime. 

Problem  28.3.  Use  Fermat’s  little  theorem  to  show  that  for  p  a  prime,  every  integer 
a  with  a  ^  0  (mod  p)  has  a  reciprocal  modulo  p. 

Problem  28.4.  In  Problem  28.3  above  you  proved  that  if  p  is  a  prime,  then  every 
integer  that  is  not  equivalent  to  zero  modulo  p  has  a  reciprocal  modulo  p. 

(a)  Find  the  reciprocals  of  1 , 2, . . . ,  6  modulo  7. 

(b)  Given  a  prime  p ,  find  all  integers  x  with  1  <  x  <  p  that  are  the  reciprocals  of 
themselves. 

(c)  Prove  the  following  theorem: 

Theorem  28.9  (Wilson’s  theorem).  If  p  is  a  prime,  then  (p—  1)!  =  —  1  (mod  p). 

Problem  28.5.  Prove  the  converse  of  Wilson’s  theorem,  namely:  For  an  integer 
n>l,  if(n—  1)!  =  —  1  (mod  n),  then  n  is  prime.  (Hint:  The  case  n  =  22  needs 
to  be  considered  separately.) 

Problem  28.6.  (a)  Let  p  and  q  be  primes  with  p  =/=  q,  and  let  a  and  b  be  two  inte¬ 

gers.  Prove  that  if  a  =  b  (mod  p)  and  a  =  b  (mod  q).  then  a  =  b  (mod  pq). 
(b)  Use  part  (a)  to  show  that  if  p  and  q  are  prime  numbers  with  p  ^  q,  and  a  is 
an  integer  satisfying  gcd (a,pq)  =  1,  ap  =  a  (mod  <7),  and  cfl  =  a  (mod  p), 
then  apq  =  a  (mod  pq). 

Problem  28.7.  In  this  problem,  we  show  that  the  following  converse  of  Fermat’s 
little  theorem  is  false:  If  a  is  a  prime  and  a"1'  =  1  (mod  m),  then  m  is  necessarily 

a  prime. 

Let  a  =  2,p  =  11,  <7  =  31,  and  m  =  pq.  Use  Problem  28.6  part  (b)  to  show  that 
the  statement  above  is  false.  (Note:  You  can  verify  this  counterexample  directly 
with  a  calculator  or  computer.  If  you  use  Problem  28.6  part  (b)  and  do  the  modular 
exponentiation  efficiently,  the  verification  can  be  done  easily  by  hand.  The  example 
is  from  [46].) 

Problem  28.8.  Let  a  and  n  be  integers  with  n  >  1  and  gcd (a,n)  =  1.  Prove  that  there 
exists  a  smallest  positive  integer  m  such  that  am  =  1  (mod  n)  and  m\<p(n). 
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Problem  28.9.  (a)  Calculate  0(52),0(53),  and  0( 54). 

(b)  For  p  a  prime  and  n  a  positive  integer,  show  that  0  (/?'')  =  /?"( 1  1  //?). 

(c)  Calculate  0(128). 

Problem  28.10.  Is  Euler’s  0-function  additive;  that  is,  for  all  m,n  £  Z+  is  it  the  case 
that  0(h;  +  h)  =  0(m)  +  0(h)?  Prove  it  or  give  a  counterexample. 

Problem  28.11.  This  problem  guides  you  through  the  proof  of  the  fact  that  Euler’s 
0-function  is  in  some  sense  multiplicative.  More  precisely,  you  will  prove 

Theorem  28.10.  Let  m  and  n  be  integers  such  that  m  >  1  and n  >  1.  //gcd(m,  n)  =  1, 
then  (j)(mn)  =  0(/n)0(n). 

(a)  Let  ki,k2,li,  and  U  be  integers  satisfying  0  <  k\,kn  <n  and  0  <  I1J2  <  m- 
Show  that  if  k\ m  +  l\n  =  k^m  +  /2H  (mod  mn ),  then  £1  =  ^2  and  l\  =  I2. 

(b)  Use  the  result  of  (a)  to  show  that  for  each  a  £  Z  there  is  exactly  one  element 
(k,l)  £  ZxZ  such  that  0<k<n,0<l<  m,  and  a  =  km  +  In  (mod  mn). 

(c)  For  positive  integers  m  and  n,  use  (b)  to  conclude  that 

|  {(&,/)  £ZxZ:0  <£<  n,  0  <  l  <  m,  and  gcd  (km  +  ln,mn)  =  1 }  |  =  <j)(mn). 

(d)  For  k,l  £  Z  with  0  <  k  <  n  and  0  <  l  <  m,  show  that  gcd (km  +  ln,mn)  =  1  if 
and  only  if  gcd (k,n)  =  1  and  gcd (l,m)  =  1. 

(e)  Use  (c)  and  (d)  to  obtain  the  conclusion  of  Theorem  28.10. 

Problem  28.12.  Use  the  results  of  Problems  28.9  and  28.11  to  answer  the  follow¬ 
ing. 

(a)  Let  m£h  and  suppose  m  =  p ■  pL£  ■  ■■■■  pL'kk,  where  pi,p2,  ■  ■  ■  ,Pk  are  distinct 
primes  and  a\ ,02,  ■  ■  ■  ,ak  are  positive  integers.  Prove  that 


(b)  Calculate  0(5712200). 

Problem  28.13.  Prove  that  for  n  £  Z+,  the  Euler  0-function  satisfies 


0(h)  if  n  is  odd 
20(h)  if  n  is  even 


Problem  28.14.  Use  the  formula  from  Problem  28.12  to  show  that  for  p  prime  and 
r  a  positive  integer,  Y.d\pr  0(<^)  =  Pr ■ 

Problem  28.15.  Let  a.b,  and  c  be  integers  such  that  gcd (a,b)  =  1  and  a\bc.  Prove 
that  this  implies  that  a\c. 


Problem  28.16.  Prove  Theorem  28.1  directly  by  adapting  the  proof  of  Theorem  28.7 
to  this  simpler  situation. 
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Problem  28.17.  In  each  of  the  two  cases  below,  use  the  method  of  Example  28.8  to 
find  an  integer  m  with  0  <  m  <  33  satisfying 

(a)  m3  =  8  (mod  33)  and  gcd(m,33)  =  1; 

(b)  to77  =  15  (mod  143)  and  gcd(;72, 143)  =  1. 

Problem  28.18.  Show  that  the  conclusion  of  Euler’s  theorem  may  not  hold  if 

gcd(fl,7i)  >  1. 

Problem  28.19.  Show  that  if  x  and  y  are  integers  and  p  is  a  prime,  then 

{x  +  y)p  =  xp  +yp  (mod  p). 

(You  may  find  the  binomial  theorem  useful  here.  If  so,  you  may  use,  without  proof, 
the  fact  that  the  binomial  coefficient  (")  is  an  integer  for  k,n  €  N  with  k  <  n.) 

Problem  28.20.  The  RSA  code  relies  on  being  able  to  calculate  me  (mod  n)  quickly. 
Fortunately,  this  is  possible  with  an  intelligent  plan  and  with  the  help  of  a  com¬ 
puter. 

(a)  Devise  a  plan  to  calculate  m100  with  the  smallest  number  of  multiplications  by 
multiplying  two  (and  only  two)  integers  at  a  time.  How  many  multiplications 
and  divisions  did  you  need?  (You  don’t  need  to  prove  that  the  number  of 
multiplications  you  require  is  the  smallest.) 

(b)  To  multiply  or  divide  two  integers  of  n  digits  each  we  need  to  do  roughly  n2 
operations.  If  m  and  n  have  600  digits,  how  many  operations  are  needed  to 
calculate  m 100  (mod  n)l 

(c)  Your  computer  is,  most  likely,  capable  of  performing  2  •  1010  operations  per 
second.  How  long  will  it  take  to  calculate  your  m100  (mod  n)l 

Problem  28.21.  The  RSA  code  can  also  be  used  as  a  signature.  In  doing  so,  the 
sender  sends  out  n  and  d  and  keeps  e  for  himself.  You  receive  two  messages,  one  is 
a  plain  message  and  the  other  is  an  encrypted  message.  You  can  check  whether  the 
two  messages  are  the  same,  because  you  have  n  and  d.  So,  if  the  encrypted  message 
unscrambles  to  the  plain  one,  then  it  must  come  from  the  sender — the  only  person 
who  knows  e. 

In  this  problem,  we’ll  look  at  a  specific  example:  The  public  signature  key  is 
n  =  77  and  d  =  13.  The  message  you  get  is  8  with  the  encrypted  form  50.  Decide 
whether  the  message  came  from  the  sender.  If  it  didn’t,  calculate  the  true  pair  of 
messages. 

Of  course  the  numbers  here  are  much  too  small,  and  you  would  be  able  to  cal¬ 
culate  e  easily.  If  n  were  a  600-digit  number,  then  you  would  no  longer  be  able  to 
do  so,  but  you  could  still  decode  the  encrypted  message  with  the  given  information 
in  a  reasonable  amount  of  time.  In  fact,  you  can  decode  the  message  very  quickly  if 
you  have  a  computer — basically  instantaneously,  if  your  computer  is  programmed 
to  do  so.  Sending  the  plain  text  can  usually  be  omitted,  because  the  probability  that  a 
randomly  generated  text  makes  sense  is  zero.  Thus,  if  the  decoded  text  makes  sense, 
the  signature  is  (for  all  practical  purposes)  correct. 


Chapter  29 
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Tips  on  Talking  about  Mathematics 

It’s  not  easy  to  talk  about  mathematics  to  other  people.  In  this  section,  we  present 

some  tips  that  we  find  helpful  when  we  present  a  talk  to  undergraduates. 

Let’s  say  someone  has  just  asked  you  to  give  a  talk  about  mathematics  to  under¬ 
graduates.  Here’s  what  you  need  to  do: 

•  Thank  them,  and  say  you’d  love  to.  Then  do  the  rest  of  the  things  below. 

•  Find  out  who  the  audience  is  and  what  they  know. 

•  Pick  an  interesting  topic.  Find  out  about  the  history  of  the  topic,  the  main  players 
in  the  field,  and  the  main  results. 

•  Now  that  you  have  your  topic,  you  need  to  write  the  talk.  Start  with  something 
everyone  is  interested  in.  This  could  be  the  history  of  what  you  plan  to  talk  about, 
or  it  could  be  an  interesting  related  result.  Then  motivate  the  question  you  are 
interested  in  looking  at,  build  up  the  talk,  and  remember  to  find  a  good  conclusion 
for  it. 

•  As  you  write  your  talk,  keep  the  level  of  the  audience  in  mind.  Do  not  use  terms 
that  your  audience  will  not  understand.  If  they  haven’t  heard  certain  words  you 
will  have  to  define  them,  which  brings  us  to  our  next  point:  the  more  terms  you 
have  to  define,  the  more  people  you  will  lose.  Pick  a  topic  that  doesn’t  require  a 
lot  of  introduction. 

•  You  need  to  decide  whether  you  will  use  transparencies,  the  computer,  or  the 
blackboard.  Each  has  its  advantages  and  disadvantages.  We’ll  run  through  each 
below. 

1.  Blackboard.  If  you  use  the  blackboard,  you’ll  most  likely  move  at  the  right 
speed  for  the  audience.  It’s  also  livelier  than  the  other  methods.  On  the  other 
hand,  you  should  absolutely  not  rely  on  your  notes.  Therefore,  if  you  give  a 
talk  using  the  blackboard,  you’ll  need  to  know  what  you  are  going  to  say  and 
when  you  are  going  to  say  it.  You’ll  need  to  watch  where  you  write  things, 
and  you  shouldn’t  erase  something  you  want  the  audience  to  look  at.  Make 
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sure  that  you  move  away  from  the  board  so  that  everyone  can  see  what  you 
wrote.  If  your  handwriting  is  illegible,  think  about  using  transparencies  or  the 
computer. 

2.  Transparencies.  Unless  you  are  very  careful,  you  will  probably  move  too 
quickly  for  the  audience.  You’ll  probably  also  stand  in  front  of  the  trans¬ 
parency  from  time  to  time,  blocking  the  audience’s  view.  If  you  are  aware 
of  these  potential  problems,  you  can  correct  them.  For  example,  you  can  use 
two  overheads.  You  should  not  write  too  much  on  one  transparency,  and  you 
should  always  be  aware  of  where  you  are  standing.  Find  out  how  big  the  room 
is,  and  make  sure  that  someone  in  the  back  of  the  room  will  be  able  to  see 
what  you  have  written.  The  advantage  of  transparencies  or  the  computer  is 
that  you’ll  have  all  your  diagrams  and  pictures  in  place,  and  you’ll  have  an 
outline  of  your  talk  with  you.  So  the  main  disadvantages  are  that  your  talk 
may  become  monotonous  and  that  it’s  possible  to  move  so  quickly  that  your 
audience  won’t  be  listening.  These  are  pretty  big  disadvantages. 

3.  Computer.  In  many  ways,  using  the  computer  to  give  your  talk  is  similar  to  us¬ 
ing  transparencies.  Many  of  the  advantages  and  disadvantages  are  the  same. 
Still,  there  are  a  few  things  that  we  should  mention.  For  mathematics,  many 
people  use  Beamer  from  LaTex.  (A  tutorial  can  be  found  at  [9].)  As  we  men¬ 
tioned,  some  things  are  the  same  as  for  transparencies.  The  basic  rule  is:  Don’t 
overdo  it.  Don’t  use  too  many  fonts,  too  many  pictures,  too  many  colors,  and 
don’t  put  too  much  on  one  screen.  However,  the  computer  presents  new  chal¬ 
lenges:  If  you  use  a  computer,  you  cannot  correct  things  in  front  of  an  audi¬ 
ence.  You’ll  need  to  proofread  your  slides  very  very  carefully.  It’s  also  pos¬ 
sible  to  use  a  tool  that  allows  you  to  stand  in  exactly  one  position,  moving 
only  your  thumb.  If  you  do  this  for  your  entire  talk,  it’s  fairly  likely  that  your 
talk — no  matter  how  well  you  prepare  it — will  be  dull.  Move!  Even  if  you 
don’t  have  to,  it’s  important  to  move  around.  If  you  must  use  a  laser  pointer, 
use  it  sparingly.  It’s  distracting  to  see  a  red  beam  moving  rapidly  around  as  you 
try  to  read  a  slide.  You  can  liven  up  the  talk  by  adding  relevant  photographs 
of  places,  manuscripts,  or  people.  You  might  even  add  a  video  clip.  Just  make 
sure  that  these  “attention  getters”  are  relevant  and  well  incorporated. 

4.  Blackboard,  Transparencies,  and  Computer.  One  thing  you  can  do  is  combine 
two  or  three  of  these  methods  of  presentation.  In  a  talk  for  undergraduates,  it’s 
nice  for  them  to  have  something  to  look  at  from  time  to  time,  other  than  the 
speaker. 

Pick  the  method  you  are  most  comfortable  with  and  that  you  like  the  best.  Then 
work  around  the  disadvantages. 

•  So  now  you  have  your  topic,  your  talk,  and  a  method  of  presentation.  You’re 
done,  right?  Um  ...  no.  You  still  have  to  present  the  talk.  Surprisingly,  the  hardest 
part  of  the  talk  is  timing.  We’ve  alluded  to  this  already  in  our  discussion  on 
transparencies,  but  there’s  more  to  be  said. 
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•  Find  out  how  long  the  talk  is.  If  it’s  twenty  minutes,  talk  for  twenty  minutes. 
(No  one  will  complain  if  it’s  eighteen  minutes,  and  everyone  will  complain  if  it’s 
thirty.)  There  is  only  one  way  to  know  how  long  your  talk  is:  practice  it. 

•  The  best  way  to  practice  a  talk  is  to  give  it  to  yourself  once.  Fix  the  things  you 
realize  need  fixing.  Then  try  to  find  an  audience  of  two  people,  one  who  knows 
what  you  are  talking  about  and  one  who  does  not.  Ask  them  if  you  can  present 
the  talk  to  them.  Listen  to  their  comments  and  use  them  to  improve  your  talk. 

•  Write  an  interesting,  but  truthful  abstract.  The  abstract  should  indicate  the  level 
of  the  talk. 

•  Before  you  give  your  talk,  ask  if  you  can  see  the  room  that  you  will  speak  in. 
Check  that  everything  you  need  is  there. 

•  Make  sure  that  everyone  in  the  room  can  hear  you  when  you  speak.  When  you 
give  the  talk,  look  at  the  audience.  They’ll  let  you  know  how  you  are  doing. 

There  are  other  articles  on  how  to  talk  about  mathematics  ([68],  [43]),  but  these 
are  primarily  aimed  at  graduate  students  or  professional  mathematicians.  Of  course, 
many  of  the  tips  are  the  same,  because  many  of  the  mistakes  people  make — whether 
talking  to  undergraduates,  graduate  students,  or  professors — are  the  same. 


29.1  Picture  Proofs 
Introduction 

You  have  probably  heard  the  saying  “a  picture  is  worth  a  thousand  words.”  The 
same  is  true  in  mathematics:  a  good  picture  can  help  a  reader  visualize  what  is 
happening,  it  can  aid  a  mathematician  in  finding  a  solution,  and  it  can  shed  light  on 
other  potential  results.  A  bad  picture,  on  the  other  hand,  can  be  deceiving.  Relying 
too  much  on  what  we  see  might  lead  us  to  incorrect  proofs,  which  in  turn  can  lead 
to  false  results.  This  project  should  help  convince  you  of  that. 

One  of  the  most  influential  theorems  in  mathematics  is  Pythagoras’  theorem.  It 
states  that  in  a  right  triangle  the  lengths  of  the  sides  of  the  triangle  satisfy  a2  +  b2  = 
c2,  where  c  denotes  the  length  of  the  hypotenuse,  and  a  and  b  denote  the  lengths  of 
the  other  two  sides  of  the  triangle.  There  are  many  known  proofs  of  this  theorem, 
some  of  them  based  on  clever  figures.  You  will  see  two  such  proofs  below. 


Prerequisites 

Basic  geometry  skills  plus  an  understanding  of  what  constitutes  a  rigorous  argument 
are  the  necessary  prerequisites  for  this  project.  We  suggest  that  you  read  through 
Chapter  5  before  attempting  this  project. 
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Guided  Project 

1 .  The  diagram  in  Figure  29. 1  suggests  a  proof  of  Pythagoras’  theorem.  To  make 
this  proof  rigorous,  however,  you  will  need  to  do  two  things;  you  need  to 
prove  something  about  the  diagram  and  you  need  to  do  an  algebraic  calcula¬ 
tion.  Do  both. 

2.  Give  a  second  proof  of  Pythagoras’  theorem  based  on  Figure  29.2.  This  one 
does  not  need  algebraic  calculations.  It  is  all  in  the  picture — or  is  it? 

3.  If  you  accepted  the  picture  of  Figure  29.2  as  a  complete  proof  of  Pythagoras’ 
theorem,  then  you  are  probably  willing  to  believe  that  Figure  29.3  provides 
a  proof  that  168  =  169.  After  all,  both  proofs  require  that  we  shift  the  pieces 
around  to  form  another  familiar  object.  What  is  wrong  with  this  proof? 

4.  Find  a  picture  to  illustrate  the  statement 


k=  l 


Does  your  picture  amount  to  a  rigorous  proof?  What  are  its  strengths  and 
what  are  its  weaknesses? 

5.  Use  the  picture  of  Figure  29.4  to  prove  that 


(This  problem  was  proposed  by  Michael  Vowe  in  [105].  Figure  29.4  is  part 
of  the  solution  given  by  Gerhard  Wanner  in  [106].) 


a 


Fig.  29.1  a2+b2  =  c2 


b 
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Fig.  29.2  a2  +  b2=c2 


Open-Ended  Project 

Try  to  find  some  other  picture  proofs.  (We  suggest  you  think  back  to  your  geometry 
course.)  Can  you  also  come  up  with  a  (somewhat)  convincing  picture  proof  of  a 
false  statement? 


Notes  and  Sources 

We  first  learned  of  the  false  proof  presented  in  part  3  from  our  colleague,  G.  Adams. 
For  a  connection  between  this  problem  and  Fibonacci  sequences  see  the  article  [54], 
where  the  author  indicates  that  this  “not  a  picture  proof”  can  be  traced  back  to  the 
year  1868.  There  are  two  excellent  books  on  picture  proofs  by  R.  B.  Nelsen,  [75] 
and  [76].  The  website  by  A.  Bogomolny  [13]  contains  84  proofs  of  Pythagoras’ 
theorem,  many  of  them  with  pictures,  and  some  of  them  with  applets. 
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Fig.  29.4  Jo1 2 3 4  fa  =  f 

29.2  The  Best  Number  of  All  (and  Some  Other  Pretty  Good 
Ones) 

Introduction 

We  know  that  positive  integers  can  be  even  or  odd,  they  can  be  prime  or  composite, 
and  they  can  be  triangular  or  square.  (Each  of  these  terms,  with  the  exception  of 
“square,”  appears  either  below  or  in  the  index  of  this  book,  if  you  have  forgotten  the 
definition.)  In  this  project  we’ll  look  at  some  more  things  that  they  may  be:  the  sum 
of  their  proper  divisors,  the  product  of  their  proper  divisors,  or  both. 


Prerequisites 

This  exercise  requires  an  understanding  of  proof  techniques  (Chapter  5). 


Guided  Project 

Recall  that  an  integer  greater  than  1  is  prime  if  its  only  positive  divisors  are  1  and 
itself.  A  positive  integer  greater  than  1  that  is  not  prime  is  called  composite.  By  a 
proper  divisor,  we  mean  a  positive  divisor  that  is  not  equal  to  the  integer  itself.  A 
positive  integer  is  said  to  be  perfect  if  it  is  the  sum  of  its  proper  divisors. 

1 .  Show  that  6  is  a  perfect  number. 

2.  Show  that  6  is  the  only  perfect  number  less  than  10. 

3.  Find  another  perfect  number  that  is  less  than  30. 

By  now  you  should  have  found  the  first  two  perfect  numbers.  The  next  is  496. 

4.  Check  that  496  is  a  perfect  number. 
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5.  Find  five  positive  integers,  each  one  being  the  product  of  all  of  its  proper 
divisors. 

6.  Characterize  all  positive  integers  that  are  the  product  of  their  proper  divisors. 

Now  you  are  almost  ready  to  prove  the  main  theorem  in  this  project.  Steps  7-9 
below  will  lead  you  through  the  proof. 

Theorem  29.1.  There  is  only  one  positive  integer  that  is  both  the  product  and  sum 
of  all  its  proper  positive  divisors,  and  that  number  is  6. 

7.  Let  p  be  a  prime.  Prove  that  p3  is  not  perfect. 

8.  Prove  as  many  of  the  following  as  you  need  to,  until  you  see  the  proof  of  the 
theorem. 

(a)  Prove  that  the  only  even  number  that  is  both  the  product  and  sum  of  all 
its  proper  positive  divisors  is  6. 

(b)  Prove  that  the  only  multiple  of  3  that  is  both  the  product  and  sum  of  all 
its  proper  positive  divisors  is  6. 

(c)  Prove  that  there  is  no  multiple  of  5  with  this  property. 

(d)  Prove  that  there  is  no  multiple  of  7  with  this  property. 

9.  Prove  the  theorem. 

The  goal  of  the  next  part  of  this  project  is  to  solve  the  following: 

Problem.  Let  n  be  an  odd  positive  integer.  Find  all  perfect  numbers  of  the  form 
n”  +  1. 

It  is  also  possible  to  find  all  perfect  numbers  of  the  form  nn  +  1  when  n  is  an  even 
integer — but  this  is  more  difficult.  The  case  n  odd  is  already  quite  difficult,  so  we 
will  help  you  by  outlining  steps  you  may  follow  to  solve  the  problem.  We  also 
suggest  that  you  use  the  following  theorem,  which  is  due  to  Euler.  (See,  for  example, 
[22,  pp.  10-1 1]  for  a  proof  of  this  theorem.) 

Theorem  29.2  (Euler).  If  N  is  an  even  perfect  number,  then  N  =  2k~  [  (2k  -  1), 
where  2k  —  1  is  prime. 

10.  Find  an  integer  a  such  that  nn  +  1  =  (: n  +  1  )a.  (The  integer  a  will  be  a  sum 
and  an  expression  that  involves  n.) 

1 1 .  Prove  that  n+  1  and  a  have  no  positive  common  divisor  other  than  1 .  (We  say 
that  n  +  1  and  a  are  relatively  prime.) 

12.  Assuming  that  n  is  odd  and  «"  +  1  is  perfect,  use  Euler’s  theorem  to  show  that 
n  +  1  =  2k  1  for  some  integer  k. 

13.  Solve  the  problem. 

This  was  posed  as  a  problem  in  the  American  Mathematical  Monthly  [72]. 
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Open-Ended  Project 

Let  p  be  a  polynomial  over  Z  or  K  (see  Problem  10.13).  What  might  it  mean  to  say 
that  a  polynomial  is  prime?  composite?  perfect?  square?  triangular?  While  some  of 
these  might  make  sense,  others  may  not.  Once  you  have  defined  the  terms  that  make 
sense,  are  there  some  interesting  theorems  you  can  prove  about  them? 


Notes  and  Sources 

Euclid,  in  his  Elements,  IX. 36,  gave  the  result  that  if  p  =  2k  —  1  is  a  prime,  then 
2k  1  p  is  perfect.  For  k  =  2,3,5,  and  7  we  note  that  2*  —  1  is  prime.  Thus  we  get  four 
perfect  numbers,  6,  28,  496,  and  8128.  The  eighth  perfect  number  is  already  quite 
large:  2,305,843,008,139,952,128.  A  good  source  to  begin  learning  more  about 
numbers  is  the  book  [20].  The  problem  we  discussed  in  the  guided  portion  of  this 
project  appears  in  [15],  which  we  recommend  to  those  wanting  to  know  more  about 
number  theory. 


29.3  Set  Constructions 
Introduction 

It  is  amazing  how  much  one  is  able  to  build  out  of  almost  nothing — and  by  almost 
nothing  here  we  mean  the  empty  set.  The  guided  project  will  lead  you  through  a 
construction  of  the  natural  numbers. 


Prerequisites 

While  the  set  theory  introduced  in  Chapters  6-9  is  sufficient,  you  may  find  it  helpful 
to  have  an  understanding  of  mathematical  induction  (Chapter  18),  which  is  also 
introduced  in  this  project. 


Guided  Project 

Let  x  be  a  set.  Define  the  successor  of  x  to  be  the  set  x+  =  xU  {x}. 

1.  Determine  the  successors  and  the  successors  of  the  successors  of  the  sets 
0,  {0},  and  {a,b,c}. 
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We  now  introduce  the  following  notation.  Let  0  =  0,  1  =0+,2  =  1+,  and  so  on. 

2.  Write  down  0,1,2, 3,  and  4  as  sets  in  two  different  ways;  first  using  the  def¬ 
initions  made  above,  and  then  using  only  the  symbol  0,  set  brackets,  and 
appropriate  set  notation. 

It  may  seem  intuitively  obvious  that  if  we  “do  this  forever,”  then  we  will  have  de¬ 
fined  the  natural  numbers.  However,  as  simple  and  as  attractive  as  this  approach  may 
be,  it  is  not  what  we  call  mathematically  rigorous.  What  we  need  is  a  statement  that 
explains  that  we  can  do  this  forever.  We  will  take  this  statement  as  an  axiom,  and 
thus  it  will  not  be  proved. 

Axiom  29.3  (Axiom  of  infinity).  There  exists  a  set  containing  0  and  containing  the 
successor  of  each  of  its  elements. 

3.  Let  I  be  a  nonempty  set  and  {A^  :  k  £  1}  be  an  indexed  collection  of  sets. 
Suppose  that  for  each  k  £  I,  the  set  A *  has  the  two  properties  (i)  0  C  A  /.  and 
(ii)  if  x  £  Ak,  then  x+  £  A £.  Show  that  the  set  Die iAk  also  has  these  two 
properties.  We  will  call  a  set  with  these  two  properties  a  successor  set. 

4.  The  axiom  of  infinity  guarantees  the  existence  of  a  successor  set.  So,  let  A 
be  an  arbitrary  successor  set.  Define  the  set  (Oa  to  be  the  intersection  of  all 
subsets  of  A  that  are  also  successor  sets.  In  symbols  we  might  write 

(Oa  =  n 

Bei 

where  I  =  {B  :  B  C  A,  and  B  is  a  successor  set}. 

By  part  3,  (Oa  is  a  successor  set.  Show  that  (Oa  =  (Ob  for  all  successor  sets 
A  and  B.  Perhaps  surprisingly,  our  definition  does  not  depend  on  our  initial 
choice  of  successor  set,  and  therefore  we  will  write  co  rather  than  (Oa  ■ 

We  call  co  the  set  of  natural  numbers.  Thus  far  we  know  that  ©  is  a  successor 
set,  and  it  is  the  only  successor  set  that  is  contained  in  every  other  successor 
set. 

5.  Prove  the  following  statement.  Suppose  S  C  co  satisfies  the  two  properties 

(i)  0  £  S,  and 

(ii)  if  x  £  S,  then  x+  £  S. 

Show  that  S  =  (0.  (This  is  called  the  principle  of  mathematical  induction  and 
is  discussed  in  Chapter  18.) 

6.  Prove  that  x+  ^  0  for  all  x  £  ft). 

7.  Consider  the  set  S  =  {x  £  co  :  Vy  £  ft),  if  y  £  x,  then  ycx}.  Use  part  5  to  show 
that  S  =  co.  Conclude  that  for  all  u  and  v  in  ft),  if  u  £  v,  then  u  C  y. 

8.  Use  part  7  to  prove  that  if  x  and  y  are  in  co  and  x+  =  y+,  then  x  =  y. 

The  two  defining  properties  ((i)  and  (ii))  of  a  successor  set,  the  principle  of  math¬ 
ematical  induction,  and  parts  6  and  8  of  this  project  are  known  as  the  five  Peano 
axioms.  They  are  the  pillars  of  the  construction  of  the  natural  numbers. 
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Open-Ended  Project 

We  have  created  new  sets  from  old  ones  using  element  relations,  union,  intersection, 
power  sets,  and  Cartesian  products.  Use  some  (or  all)  of  these  to  create  new  sets 
from  the  empty  set.  Do  your  new  sets  have  some  interesting  properties? 


Notes  and  Sources 

This  project  is  guided  by  Chapters  1 1  and  12  of  P.  Halmos’  Naive  Set  Theory  [41]. 
For  a  brief  presentation  of  the  Peano  axioms  and  some  other  attempts  to  give  the 
natural  numbers  a  solid  foundation  see  [59,  pp.  987-989]. 


29.4  Rational  and  Irrational  Numbers 
Introduction 

We  know  that  when  we  add  two  rational  numbers,  the  result  is  a  rational  number. 
For  this  reason,  we  say  that  the  rationals  are  closed  under  addition.  Similarly,  when 
we  multiply  two  rational  numbers,  the  result  is  rational.  Thus,  the  rationals  are  also 
closed  under  multiplication.  In  this  project,  you  will  investigate  the  behavior  of  the 
rationals  and  irrationals  under  other  operations. 


Prerequisites 

This  project  relies  on  proofs  in  cases  (Chapter  5),  as  well  as  familiarity  with  ratio¬ 
nal  and  irrational  numbers.  In  particular,  you  will  need  to  use  the  fact  that  s/7,  is 
irrational. 


Guided  Project 

Let  a  and  b  be  two  irrational  numbers. 

1 .  Give  an  example  of  two  irrational  numbers  a  and  b  such  that  a  +  b  is  irrational. 

2.  Give  an  example  of  two  irrational  numbers  a  and  b  such  that  a  +  b  is  rational. 

So  the  irrational  numbers  are  not  closed  under  addition  and  certainly  are  less  well 
behaved  than  the  rational  numbers.  Now  consider  two  real  numbers,  a  and  b. 
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3.  Give  an  example  of  two  rational  numbers  a  and  b  such  that  ab  is  rational. 

4.  Give  an  example  of  two  rational  numbers  a  and  b  such  that  ab  is  irrational. 

Here’s  a  charming  little  proof,  based  entirely  upon  things  that  you  have  proved 
in  this  course,  that  an  irrational  number  raised  to  an  irrational  power  can  be  rational. 

5.  Consider  the  following. 

Theorem  29.4.  There  exist  irrational  numbers  a  and  b  such  that  ab  is  ratio¬ 
nal. 

Complete  the  proof  of  this  theorem,  using  appropriate  choices  for  a  and  b  and 
the  two  cases  below: 

Case  1 .  \J2  “  is  a  rational  number; 

Case  2.  \J2^ ~  is  an  irrational  number. 

The  interesting  thing  about  your  proof  of  Theorem  29.4  is  that  you  don’t  need  to 

yj2 

know  whether  \[2  “  is  rational  or  irrational! 

6.  There  are  many  other  examples  of  irrational  powers  of  irrational  numbers  that 
are  rationals,  assuming  you  know  lots  of  different  ways  to  express  irrational 
numbers.  See  if  you  can  come  up  with  another  example  based  on  the  fact 
that  the  natural  logarithm  of  2,  denoted  In  2,  is  irrational.  Can  you  find  other 
examples? 

Knowing  that  an  irrational  number  to  an  irrational  power  may  be  rational  raises 
the  question  of  whether  an  irrational  to  an  irrational  can  be  irrational.  Again,  we 
are  looking  for  a  proof  that  does  not  use  anything  more  than  what  we  stated  in  the 
prerequisites.  There  are  some  nonelementary  proofs  of  this,  but  an  elementary  proof 
exists  as  well  [56]. 

7.  Prove  the  following  theorem. 

Theorem  29.5.  There  exist  irrational  numbers  a  and  b  such  that  ab  is  irra¬ 
tional. 

We  suggest  that  you  consider  using  a  proof  in  cases,  with  \/2  ~  for  one  of 
your  cases. 


Open-Ended  Project 

Study  the  behavior  of  the  rationals  and  irrationals  under  different  operations.  Your 
investigations  might  deal  with  specific  numbers,  or  with  the  rationals  and  irrationals 
in  general.  For  example,  is  \/2+V3  irrational?  In  another  direction,  can  you  define 
an  operation,  0,  such  that  a  ©  b  is  irrational  for  all  irrational  a  and  hi  Think  of  other 
questions  along  these  lines  and  try  to  answer  them. 
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Notes  and  Sources 

The  connection  of  this  problem  to  Hilbert’s  seventh  problem  is  discussed  in  the 
Spotlight:  Hilbert’s  Seventh  Problem  at  the  end  of  this  chapter.  The  proof  that  an 
irrational  number  to  an  irrational  power  can  be  irrational  appears  in  [56].  These 
authors  attribute  the  proof  of  Theorem  29.4  to  D.  Jarden,  [55].  This  problem  appears 
as  a  “fun  fact”  on  the  Web  at  [104]. 


29.5  Irrationality  of  e  and  n 
Introduction 

The  problems  in  this  project  require  knowledge  of  calculus.  More  specifically,  you 
need  to  know  what  the  number  e  is,  what  a  geometric  series  is,  and  what  the  series 
expansion  for  e  is.  If  you  have  seen  all  this,  then  you  probably  have  also  been  told 
that  e  is  an  irrational  number.  The  first  task  of  this  project  is  to  work  through  Ivan 
Niven’s  proof  of  this  fact.  If  you  have  never  seen  the  proof,  it’s  a  nice  application  of 
series.  Everything  you  need  to  prove  that  e  is  irrational  is  provided  in  this  project. 

The  proof  that  71  is  irrational,  outlined  in  this  project,  is  also  due  to  I.  Niven. 
In  his  words,  “In  the  June  1947  issue  of  the  Bulletin  of  the  A.  M.  S.,  I  gave  a  one 
page  proof  that  n  is  irrational.  I  had  worked  on  this  problem  for  a  specific  reason:  in 
the  first  edition  (1938)  of  what  is  now  a  great  classic.  Introduction  to  the  Theory  of 
Numbers,  by  G.  H.  Hardy  and  E.  M.  Wright,  the  authors  made  the  observation  that 
‘There  is  no  simple  proof  of  the  irrationality  of  71.'  I  wondered  why  this  should  be 
so.”  (See  [3]  for  the  full  text  of  Niven’s  conversation.) 

We  have  provided  you  with  all  the  steps  you  need  to  re-create  Niven’s  one-page 
proof. 


Prerequisites 

Since  the  proofs  are  by  contradiction,  you  will  need  to  have  covered  Chapter  5.  This 
project  also  assumes  that  you  have  a  basic  understanding  of  infinite  series. 

For  the  proof  that  these  numbers  are  irrational,  you  will  need  to  recall  three  re¬ 
sults  from  your  calculus  course.  The  first  is  that,  for  —  1  <  r  <  1,  the  geometric  series 
satisfies 

fv  =  1 

72=0 


1  —  r 


The  second  fact  is  that  the  series  expansion  for  ex  is 


'  =  1  Tx/1 !  +x2/2!  +x3/3!  +  •  •  •  +//£:!  +  • 
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The  last  result  that  you  will  need  is  the  product  rule  for  differentiation. 


Guided  Project 


1 .  Prove  the  following  theorem,  using  the  steps  outlined  below. 
Theorem  29.6.  The  number  e  is  irrational. 

Step  1.  Let  k  £  Z+.  Show  that 

1  1  1 

(fc+1)  (fc+l)(&  +  2)  (k  +  1)  (k  +  2)  (k  +  3) 

1  ,  i  ,  i 

“  (£+l)  +  (£+l)2  +  (£+l)3+'"' 


Step  2.  Prove  that  if  k  is  an  integer  with  k>2,  then 

1  1  _ 1 _ 

(fc+1)  (k+  \){k  +  2)  (k  +  1 )  (k  +  2)  [k  +  3) 


Step  3.  Suppose  to  the  contrary  that  e  is  rational.  Prove  that  this  implies  there 
exists  an  integer  k  such  that  k\e  is  an  integer. 

Step  4.  Using  the  series  expansion  for  e,  show  that  k\e  is  never  an  integer. 
This  step  should  complete  the  contradiction. 

2.  Prove  the  following  theorem  by  contradiction,  using  the  steps  below. 

Theorem  29.7.  The  number  n  is  irrational. 

Suppose  to  the  contrary  that  there  are  positive  integers  a  and  b  such  that 
K  =  a/b.  We  will  write  f<mi  for  the  rath  derivative  of  /. 

For  n  £  Z+,  define  the  two  polynomials  /„  and  F„  by 


fn(x) 


xn{a-bx)n 

n\ 


and 


Fn(x)  =  f„(x)  ~fn2\x)  +  fn\x) - b  (-1 )"  f^"]  (x) . 

We  will  determine  a  value  for  n  in  the  fifth  step  below.  Until  then,  assume  that 
n  is  a  positive  integer. 

Step  1 .  Show  that  for  every  positive  integer  j,  all  of  the  following  are  integers: 

fn(0),  f„{n)  =  fn{a/b),  /„W (0),  and  fil] {k)  =  f,[J\a/b). 

Step  2.  Prove  that  f„(x)  sin x  =  j^(F^(x)smx  —  Fn(x)cosx). 

Step  3.  Prove  that  Jq  f„(x)  sinxdx  =  F„( ri)  +Fn(0). 

Step  4.  Find  the  maximum  of  the  function  /„  on  the  interval  [0, 7t].  (Note  that 
the  maximum  depends  on  n.) 

Step  5.  Prove  that  for  n  sufficiently  large,  Jq  fn(x)  sinxdx  is  not  an  integer. 
This  step  should  complete  the  contradiction. 
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Open-Ended  Project 

Can  you  prove  that  e 2  is  irrational?  What  else  can  you  prove  is  irrational? 


Notes  and  Sources 

In  1737,  Euler  showed  that  e  is  irrational.  Johann  Heinrich  Lambert  showed,  in 
1761,  that  K  is  irrational.  The  number  n  has  a  very  interesting  history.  For  a  brief 
history  of  71,  see  [26,  p.  100],  For  a  fuller  account,  up  to  about  1971,  see  [11].  For 
more  recent  developments,  see  [7]. 

The  one -page  proof  in  the  Bulletin  of  the  A.M.S.  that  Niven  refers  to  in  the  quote 
above  can  be  found  in  [77].  The  reference  for  Hardy  and  Wright’s  text  is  given  in 
[46].  The  conversation  with  Niven  appears  in  [3]. 


29.6  A  Complex  Project 
Introduction 

In  this  project,  we  investigate  the  complex  numbers.  In  order  to  say  something  in¬ 
teresting,  we  will  need  to  assume  several  facts — some  elementary  and  some  not. 


Prerequisites 

You  will  need  to  use  the  material  on  relations  in  Chapter  10  and  order  in  Chap¬ 
ters  12  and  13.  In  particular,  you  must  be  familiar  with  the  order  definitions:  Def¬ 
initions  13.1  and  13.2.  We  will  introduce  complex  numbers  here,  and  we  will  not 
assume  that  you  have  already  seen  them.  This  project  also  assumes  a  basic  famil¬ 
iarity  with  series.  If  you  have  taken  a  calculus  course  that  covered  infinite  series  of 
functions  including  ex,  sinjr,  and  cosx,  you  will  have  the  necessary  background  in 
series. 


Guided  Project 

Let’s  begin  by  recalling  the  complex  numbers, 

C  =  {z  :  z  =  a  +  bi,  where  a,iel  and  i2  =  —  1 } . 
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Complex  numbers  can  be  thought  of  in  several  ways.  When  considered  as  above, 
it’s  easy  to  see  how  to  multiply  and  add  them:  For  z,  w  £  C  write  z  =  a  +  bi  and 
w  =  c  +  di,  where  a,b,c,d  €  M.  Then 

z  +  w  =  (a  +  bi)  +  (c  +  di)  =  (a  +  c)  +  (b  +  d)i 


and 

z-w=  (a  +  b\)(c  +  d  i)  =  ( ac  —  bd)  +  (ad  +  be)  i. 

Using  these  definitions  and  the  properties  stated  in  the  Appendix  on  page  363 
(Algebraic  Properties  of  R),  answer  the  following  questions: 

1.  Which  of  the  properties  A1-A4,  M1-M4,  and  Dl  in  the  Appendix  on  page  363 
apply  to  the  complex  numbers  with  multiplication  and  division  defined  as 
above? 

There  is  another  way  to  think  about  complex  numbers  that  is  quite  helpful  for  com¬ 
putations.  For  this  notation,  you  should  think  in  polar  coordinates.  For  z  =  0,  for 
example,  we  can  write 


0  =  O  +  Oi  =  0(cos  9  +  isin0), 

for  any  choice  of  a  real  number  0.  For  z^  0,  we  may  write 

z  =  a  +  b\  =  r(cos0  +  isin0)  (29.1) 

where,  as  you  learned  when  you  discussed  polar  coordinates,  9  £  R  is  the  measure  of 
the  angle  from  the  polar  axis  to  the  line  joining  the  origin  to  the  point  ( a,b ).  When 
9  >  0,  the  measure  is  taken  in  the  counterclockwise  direction  (as  in  Figure  29.5) 
and  when  9  <  0  it  is  taken  in  the  clockwise  direction.  The  nonnegative  real  number 
r  =  (a2  +  b2) 1//2  represents  the  distance  of  z  to  the  origin.  Of  course,  there  are  in¬ 
finitely  many  choices  for  0,  for  once  one  choice  works  9  +  2nn  will  work  for  every 
integer  n. 

If  you  haven’t  done  this  before,  you  should  practice  writing  several  numbers  in 
both  notations.  You  can  check  that  you  have  the  correct  answer  using  a  calculator  or 
computer. 

Euler’s  formula  provides  another  way  to  write  complex  numbers.  Euler’s  formula 
says  that  for  any  real  number  t. 


elt  =  cost  +  i  sin  t.  (29.2) 

As  a  consequence  of  equations  29.1  and  29.2,  we  can  write  every  complex  num¬ 
ber  in  the  form  re10,  where  9  and  r  are  real  numbers  and  r>  0. 

We’ll  ask  you  to  provide  a  justification  for  Euler’s  formula  in  a  moment.  Before 
proceeding,  we  provide  two  exercises  to  convince  you  that  multiplication  and  raising 
complex  numbers  to  high  powers  is  much  more  pleasant  if  you  use  Euler’s  formula. 
What  we  mean  is  the  following:  Consider  z  =  reia  and  w  =  se'^ .  Then 
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Fig.  29.5  The  complex  number  a  +  bi  =  re'6  with  0  >  0 


zw  =  (reia){seil3)  =  rse^a+p\  (29.3) 

2.  Multiply  e17t/4  and  2e17t/3  by  writing  the  first  as  a  +  bi,  the  second  as  c  +  c/i, 
and  then  performing  the  multiplication.  Then  multiply  the  two  numbers  using 
equation  (29.3).  Finally,  show  that  the  two  answers  are  equal.  (For  the  final 
part  of  this  problem,  you  may  want  to  use  the  formulas  for  sin(x  +  y)  and 
cos(x  +  y).) 

3.  Compute  (1  +  i ) 1 0000  without  a  calculator.  You  should  write  the  answer  in  the 
form  s"(a  +  hi),  where  s  >  0. 

We  are  not  interested  in  a  rigorous  proof  of  Euler’s  formula  here,  but  we  hope  to 
convince  you  of  its  validity  using  facts  about  series  from  your  calculus  classes.  So 
recall  that  for  all  real  x  the  following  series  converge  as  indicated: 


e 


X 


cosx  = 


i 

n= 0 


(-l)”x2" 
(2  n)\ 


and  sinx  = 


~  (-l)”x2n+1 

h)  (2n+l)! 


4.  Ignoring  issues  of  convergence  in  the  proofs  below,  assuming  that  you  may 
replace  x  by  if  in  the  equations  above,  and  rearranging  terms  in  the  series  if 
need  be,  show  that  you  can  obtain  Euler’s  formula:  elt  =  cos  t  +  i sinf . 

In  particular,  when  t  =  n  we  have 

ein  =  cos7T  +  isin7r, 


or 

ei7t+l  =0, 

which  is  sometimes  called  Euler’s  equation. 

5.  Show  that  el7C/2  =  i.  Using  this  fact  and  assuming  that  exponentiation  works 
as  usual,  show  that  i1  is  real.  Once  you  have  completed  this  part  of  the  project, 
please  read  the  notes  (under  Notes  and  Sources)  provided  below. 

6.  Complex  numbers  are,  actually,  more  complex  than  real  numbers.  Our  brief 
discussion  of  exponentiation  has  already  pointed  out  one  potential  problem. 
There  are  some  very  famous  “false  proofs”  involving  complex  numbers.  Here 
is  a  proof  that  1  =  —  1.  See  if  you  can  figure  out  where  the  error  is. 


29.6  A  Complex  Project 
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Not  a  proof.  Since  (—  1 ) / 1  =  1  /(—  1 )  we  may  take  square  roots  to  obtain  the  equa¬ 
tion  ^/— 1/1  =  \j'\  /  —  1 .  Thus  \J  —  1/vT  =  \/T/ V  T.  So  i  =  1/i  and  multiplying 
the  equation  by  i,  we  get  —1  =  1.  H 

We  probably  all  can  agree  that  something  is  wrong.  Find  the  mistake  and  write  a 

concise  explanation  of  what  the  error  is. 

In  the  final  part  of  this  guided  project,  we  discuss  order  on  the  complex  numbers. 
In  Chapter  12  we  studied  some  properties  of  the  ordered  sets  R  and  N.  For  example, 
we  introduced  the  well-ordering  principle  of  N.  Sets,  including  N  and  R,  may  be  or¬ 
dered  in  many  ways.  In  this  project,  we  will  discuss  a  specific  order  on  the  complex 
numbers.  At  this  point,  you  will  need  to  recall  Definitions  13.1  and  13.2. 

We  can  define  an  order  on  the  complex  numbers  as  follows:  If  z  =  a  +  bi  and 
w  =  c  +  di,  we  will  say  that 

Z  A  w  if  a  <  c  or  we  have  a  =  c  and  b  <  d. 

Note  that  if  the  numbers  z  and  w  are  real,  then  z  =  a  +  Oi  and  w  =  c  +  Oi  and  this 

order  reduces  to  the  usual  one  (<)  on  R.  This  order  is  called  the  lexicographical 

order  on  C  because  the  order  is  the  same  as  the  one  used  to  order  the  words  in  a 
dictionary. 

7.  Show  that  the  lexicographical  order  on  C  is  a  partial  order. 

8.  Show  that  the  lexicographical  order  on  C  is  a  total  order. 

9.  One  important  property  of  R  is  the  following:  If  a.h.  and  c  are  real  numbers 
with  a  <b  and  0  <  c,  then  ac  <  be.  We  now  investigate  this  property  in  C. 
We  have  seen  that  the  lexicographical  order  on  C  reduces  to  the  usual  less 
than  or  equal  to  relation  on  R.  Writing  u  A  v  for  u  A  v  and  u  /  v,  does  the 
lexicographical  order  preserve  the  property  above?  That  is,  if  z  A  w  and  0  Am, 
is  it  always  the  case  that  zu  A  wul  Either  prove  this  or  give  a  counterexample. 


Open-Ended  Project 

This  part  of  the  project  is  an  exercise  in  searching  journals  and/or  the  Web.  There 
are  at  least  two  other  “proofs”  of  Euler’s  formula.  Do  a  search  until  you  find  the  one 
you  feel  is  simplest.  You  should  have  at  least  three  proofs  (including  the  one  given 
here).  Explain  the  one  you  have  decided  is  simplest,  providing  as  much  detail  as  you 
can.  Summarize  the  other  two  and  explain  why  you  believe  the  proof  you  chose  is 
the  “best.” 
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Notes  and  Sources 


In  your  proof  that  i1  is  real,  we  told  you  to  use  the  fact  that  e'n/2  =  i.  It  is  also  true 
that  el57c/2  =  i  and  had  we  suggested  you  use  e15”/2  instead,  you  would  have  obtained 
a  different  value  of  i1.  That’s  because  exponentiation  of  complex  numbers  is  more 
complicated  than  that  of  real  numbers  and  it  involves  a  discussion  of  “multivalued 
functions”  that  we  will  not  address  here.  Instead,  we  recommend  [95]  and  [74].  We 
do  note,  however,  that  this  is  like  a  complex  version  of  Theorem  29.4,  which  showed 
that  there  exist  irrational  numbers  a  and  b  such  that  ab  is  rational. 

The  equation  e17C  +  1  =  0  is  considered  one  of  the  most  beautiful  equations  of 
all  time.  One  reason  for  this  is  that  it  provides  a  relationship  between  e,  i,  n,  1, 
and  0,  which  are  considered  five  of  the  most  important  constants  in  mathematics.  In 
fact,  readers  of  Physics  World  were  asked  in  2004  to  vote  for  their  favorite  equation 
and  this  one  came  in  first,  beating  out  other  obvious  contenders  such  as  E  =  me2. 
(Admittedly,  it  tied  with  Maxwell’s  equations  of  electromagnetism.)  Nahin,  [73], 
wrote  a  whole  book  about  what  he  called  “Dr.  Euler’s  Fabulous  Formula.” 

You  might  wish  to  conduct  a  survey  of  your  mathematical  friends  and  teachers  to 
see  what  they  think  is  the  most  beautiful  equation.  It  will  almost  certainly  be  Euler’s 
equation! 


29.7  When  Does  f~x  =  1//? 

Introduction 

Students  often  confuse  the  inverse  of  /,  denoted  /  ,  with  the  multiplicative  inverse 

of  /,  denoted  1//.  When  are  these  two  equal?  Surprisingly,  although  the  mistake 
of  assuming  /~x  =  1//  is  common,  functions  that  have  this  seemingly  intuitive 
property  are  not  common  at  all. 


Prerequisites 

This  project  requires  an  understanding  of  functions  and  their  inverses,  presented  in 
Chapters  14-16. 


Guided  Project 

In  what  follows,  /  :  X  — >  Y  will  always  denote  a  bijective  function  between  two 
subsets,  X  and  Y,  of  R  satisfying  /-1  =  1//. 


29.8  Pascal’s  Triangle 
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1 .  What  can  you  say  about  the  domain  and  range  of  such  a  function? 

2.  Find  an  example  of  such  a  function,  where  the  domain  of  /  consists  of  a  single 


point. 

3.  Find  an  example  of  such  a  function  on  a  domain  consisting  of  two  points. 

4.  Can  such  a  function  /  exist  on  the  integers?  Why  or  why  not? 

5.  Show  that  (/ o  /)(x)  =  1/x  and  /(l//(x))  =  x  for  all  x  €  X. 

6.  Show  that  /(I /x)  =  1  //(x)  for  all  x  €  X. 

7.  Define  a  function  g  :  (R  \  {0})  — >  (R  \  {0})  by 


Show  that  g  satisfies  g  1  =  l/g  on  its  domain  R \  {0}. 

8.  Can  you  find  other  examples  of  such  functions? 


Open-Ended  Project 

We  mention  here  some  other  common  errors  that  occur  with  functions  /  :  X  — >  R, 
where  XCK.  Students  often  confuse  the  composition  fof  with  the  product  /  •  /, 
where  (/  •  /)(x)  =  f(x)  ■  /(x)  for  all  x  £  X.  What  can  you  say  about  a  function  / 
that  satisfies  /©/  =  /•/? 

Yet  another  problem  arises  with  powers.  Which  functions  /  :  X  — >  R.  have  the 
property  that  /(x2)  =  (/(x))2  for  all  x£X? 


Notes  and  Sources 

This  project  is  based  upon  two  interesting  articles.  The  first  article,  [6],  has  sev¬ 
eral  other  interesting  questions  and  problems  for  students.  Some  of  them  require 
knowledge  of  continuous  functions.  The  second  article,  [18],  is  rather  advanced, 
and  it  presents  much  more  than  we  have  here.  It  includes  a  look  at  complex-valued 
functions. 


29.8  Pascal’s  Triangle 
Introduction 

In  this  project  you  will  explore  an  arithmetical  triangle  that  was  the  object  of  study 
by  Blaise  Pascal  in  a  treatise  he  wrote  in  1654  (though  it  was  known  to  mathemati- 
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cians  before  him).  He  used  this  triangle  to  solve  a  question  posed  to  him  about  gam¬ 
bling.  You  can  find  out  more  about  the  history  of  this  problem  from  the  references 
at  the  end  of  the  project. 


Prerequisites 

This  project  is  appropriate  after  Chapter  18  on  induction  has  been  covered.  You 
should  read  over  Problem  18.25  before  you  begin. 


Guided  Project 

Pascal’s  triangle  is  presented  below.  Each  line  has  one  more  entry  than  the  previous 
line.  All  entries  along  the  left  and  right  edges  of  the  triangle  are  one.  Every  other  en¬ 
try  in  a  line  is  the  sum  of  the  two  numbers  on  the  line  above  that  lie  to  the  immediate 
left  and  right.  The  triangle  is  unbounded  below. 

1 

1  1 
1  2  1 

13  3  1 

1  4  6  4  1 

1  5  10  10  5  1 


Recall  that  we  defined  n  factorial  and  the  binomial  coefficient  (jj  in  the  problems 
in  Chapter  18. 

The  first  few  exercises  should  help  familiarize  you  with  Pascal’s  triangle. 

1.  Compute  each  of  the  following: 


2.  Solve  Problem  18.25  (c)  if  you  haven’t  already.  In  other  words,  prove  that  for 
all  k,  n  €  N  with  1  <  k  <  n,  we  get 


3.  Use  the  definition  of  Pascal’s  triangle  given  above  to  show  that  all  entries  in 
Pascal’s  triangle  are  binomial  coefficients  and  find  a  familiar  mathematical 
expression  for  the  kth  entry  from  the  left  in  the  nth  row.  (The  first  entry  from 
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the  left  is  entry  0  and  the  first  row  is  row  0.)  Use  induction  to  prove  that  your 
familiar  expression  is  correct. 

4.  For  each  n  £  N,  consider  the  statement 


(a)  Check  this  formula  for  a  few  small  values  of  n. 

(b)  Prove  the  statement.  (You  may  use  Theorem  18.10.) 

(c)  Show  how  you  can  obtain  this  sum  using  Pascal’s  triangle. 

5.  For  each  n  £  Z+,  consider  the  statement 


(a)  Do  something  clever  for  a  few  n  (as  you  did  in  4  (a)). 

(b)  Prove  the  statement. 

(c)  Show  how  you  can  obtain  this  sum  using  Pascal’s  triangle. 

6.  How  does  Pascal’s  triangle  relate  to  the  Binomial  Theorem  (Theorem  18.10)? 

Now  you  are  ready  for  the  main  task  of  this  project.  Work  it  carefully.  Be  as 
creative,  imaginative,  clever,  and  resourceful  as  possible. 


Open-Ended  Project 

Find  a  pattern  that  appears  in  Pascal’s  triangle,  but  that  does  not  already  appear  in 
the  text.  State  your  formula  carefully,  and  then  prove  the  result.  There  are  many 
different  patterns ! 


Notes  and  Sources 

Pascal’s  original  article,  in  Latin  with  a  French  translation,  appears  in  [81].  A  very 
readable  comprehensive  history  of  Pascal’s  triangle  can  be  found  in  [23]. 
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0  1 

Fig.  29.6  First  stage:  E\ 

0  *  *  *  *  *  *  1 

Fig.  29.7  Second  stage:  £2 

29.9  The  Cantor  Set 
Introduction 

In  this  project,  you’ll  learn  about  the  Cantor  set — a  set  that  is  in  some  ways  very 
small,  and  in  other  ways  very  big. 


Prerequisites 

Proofs  in  this  section  are  by  induction.  You  will  need  the  background  provided  by 
Chapter  18,  and  Chapters  21-23. 


Guided  Project 

1.  (The  Cantor  Set)  To  construct  the  Cantor  set,  let  I  =  [0, 1]. 

(a)  (First  stage.)  We  will  remove  the  middle  third  of  this  set;  that  is,  we  re¬ 
move  the  open  interval  (1/3, 2/3)  from  [0, 1].  So  two  intervals  remain. 
(See  Figure  29.6.)  Let  £j  =  /\ (1/3, 2/3)  =  [0, 1/3]  U  [2/3, 1].  If  you 
were  to  assign  a  length  to  £1,  what  length  would  you  assign? 

(b)  (Second  stage.)  Remove  the  middle  open  third  from  each  of  the  two 
remaining  intervals;  that  is,  let  £2  =  £1  \  ((1  /9, 2/9)  U  (7/9, 8/9)).  So 
£2  is  a  union  of  four  closed  intervals.  (See  Figure  29.7.)  Write  £2  as 
this  union  of  four  closed  intervals.  If  you  were  to  assign  a  length  to  £2, 
what  length  would  you  assign? 

(c)  (Third  stage.)  Remove  the  middle  open  third  from  each  of  the  remain¬ 
ing  four  intervals.  Thus  £3  is  a  union  of  eight  intervals.  (See  Fig¬ 
ure  29.8.)  Write  £3  as  a  union  of  these  eight  closed  intervals.  If  you 
were  to  assign  a  length  to  £3,  what  length  would  you  assign? 

(d)  (nth  stage.)  Now  consider  £„,  obtained  from  £„_  1  by  removing  the 
open  middle  thirds  of  each  of  the  intervals  that  compose  £„_  1.  If  you 
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0 


1 


Fig.  29.8  Third  stage:  £3 


were  to  assign  a  length  to  En,  what  length  would  you  assign?  State  your 
guess  for  the  length  of  En  in  a  complete,  coherent  sentence.  Prove  that 
your  guess  is  correct. 

The  Cantor  set  is  the  set  E  defined  by  E  =  f|“=i  E„. 

2.  If  you  were  to  assign  a  length  to  E,  what  length  would  you  assign?  Why? 

3.  Give  examples  of  numbers  that  you  know  are  in  the  Cantor  set;  that  is,  give 
examples  of  numbers  that  are  in  E„  for  every  n. 

4.  Another  view  of  the  Cantor  set.  There  are  far  more  points  in  the  Cantor  set 
than  you  might  think.  To  see  this,  it  is  best  to  revisit  the  Cantor  set. 

Each  point  x  in  the  interval  [0, 1]  has  something  called  a  ternary  expansion. 
The  first  digit  in  the  ternary  expansion  for  x,  denoted  xi,  is  found  as  follows: 
We  divide  the  interval  [0, 1]  into  thirds.  If  x  lies  in  the  first  third,  [0, 1  /3],  we 
assign  xi  the  value  0.  If  x  lies  in  the  middle  third,  [1/3, 2/3],  we  assign  x\ 
the  value  1,  and  if  it  lies  in  the  last  third,  [2/3, 1],  we  assign  xi  the  value  2. 
(We  note  that  there  is  some  ambiguity  about  what  happens  at  the  endpoints. 
When  working  with  the  Cantor  set  (as  we  discuss  below),  whenever  we  have 
a  choice,  we  will  choose  either  0  or  2  and  not  the  number  1 .) 

We  now  proceed  to  the  second  digit,  X2,  in  the  ternary  expansion,  which  we 
find  as  follows:  Ifxi  =  0,  then  x  lies  in  the  interval  [0, 1/3].  Divide  this  interval 
into  thirds.  If  x  lies  in  the  first  third,  [0, 1  /9],  we  assign  x?  the  value  0.  If  x  lies 
in  the  middle  third,  [1  /9, 2/9],  we  assign  X2  the  value  1.  And  if  x  lies  in  the 
final  third,  [2/9, 3/9],  we  assignx2  the  value  2.  Similarly,  ifxi  =  1,  thenx  lies 
in  the  interval  [1  /3, 2/3],  and  we  assignx2  a  value  of  0  if  x  lies  in  the  interval 
[3/9, 4/9],  a  value  of  1  if  x  lies  in  the  interval  [4/9, 5/9],  and  a  value  of  2  if  x 
lies  in  the  interval  [5/9, 6/9].  Finally,  ifxj  =  2,  thenx  lies  in  [2/3, 1],  and  we 
divide  this  interval  into  thirds,  assigning  X2  the  value  of  0, 1,  or  2. 

For  X3,  we  use  xi  and  X2  to  tell  us  which  interval  to  look  at.  We  then  divide 
that  interval  into  thirds,  and  we  assign  a  value  of  0,1,  or  2  to  X3.  It  should 
be  clear  that  all  endpoints  will  have  two  possible  representations,  while  all 
other  points  will  have  exactly  one  representation.  Comparing  the  procedure 
defined  in  part  1  of  this  project,  with  the  procedure  we  have  outlined  to  find 
the  ternary  expansion  of  x,  we  see  that  the  Cantor  set  consists  of  all  points 
for  which  there  exists  a  ternary  expansion  consisting  of  0’s  and  2’s.  (That’s 
why  we  never  chose  the  number  1 .)  Without  going  into  too  many  details,  the 
ternary  expansion  really  means  that 


x=  E  31’  where  xk  =  0,1,  or  2. 
k=\  i 
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(a)  There  are  two  ternary  representations  for  the  number  1/3.  What  are 
they? 

(b)  There  are  two  ternary  representations  for  the  number  2/3.  What  are 
they? 

(c)  Find  the  first  six  terms  of  the  sequence  associated  with  1  /4. 

(d)  Find  the  first  six  terms  of  the  sequence  associated  with  1/8. 

5.  If  you  have  studied  series,  then  you  recall  that  for  —  1  <  r  <  1,  we  have  the 

following  formula  for  the  sum  of  the  geometric  series:  £~=1  =  r/(  1  —  r). 

Using  the  first  six  terms  of  the  ternary  expansion  for  1/4  that  you  deter¬ 
mined  above,  guess  all  the  other  digits  in  the  expansion.  Then  sum  the  series 
Lr=i  xk/3k  to  show  that  you  have  found  the  representation  for  1  /4.  Does  1  /4 
lie  in  the  Cantor  set?  What  about  1/8? 

6.  We  have  presented  the  outline  of  a  proof  that  there  is  a  one-to-one  correspon¬ 
dence  between  points  in  the  Cantor  set  and  sequences  of  0’s  and  2’s.  Fill  in 
the  details,  and  use  this  to  prove  the  theorem  below. 

Theorem  29.8.  The  Cantor  set  is  uncountable. 

So  the  Cantor  set  has  “length”  zero,  but  is  an  uncountable  set.  You  can  learn 
more  about  the  Cantor  set  (much  more)  and  the  idea  of  length  in  the  reference  given 
below. 


Open-Ended  Project 

What  happens  if  instead  of  removing  the  middle  third  of  the  set,  you  remove  the 
middle  fifth?  Think  about  other  sets  you  can  create  in  this  way,  and  say  as  much  as 
you  are  able  to  about  them. 


Notes  and  Sources 

This  topic  is  discussed  in  many  textbooks.  In  particular,  a  summary  of  many  of  the 
interesting  properties  of  this  set  can  be  found  in  [8,  pp.  352-354].  For  a  short  article 
(with  a  card  trick)  relating  the  Cantor  set  to  fractals,  see  [12,  pp.  1 14-121]. 


29.10  The  CBS  Inequality 
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29.10  The  Cauchy-Bunyakovsky-Schwarz  Inequality 
Introduction 

In  this  project  you’ll  prove  two  inequalities.  The  second  of  the  two  is  the  triangle 
inequality  in  R",  and  the  first  is  used  to  prove  the  second. 


Prerequisites 

What  you  need  for  this  project  depends  upon  how  you  prove  it.  You  may  need  little 
to  no  background,  other  than  an  understanding  of  what  R"  is  and  how  you  add  and 
subtract  in  that  space. 


Guided  Project 

Consider  two  points,  x  =  (xi,X2,  ■  ■  ■  ,x„)  and  y  =  (yi,y2,  ■  ■  ■  ,yn ),  in  R".  Recall  that 
x  +  y=  (xj  +  yi,x2+y2,-..,x„+y„)  and  x-y  =  xiyi  +  x2y2  H - bx„y„.  We’ll  in¬ 

troduce  some  notation  that  will  make  things  neater.  We’ll  write  x-y  =  Jj'j=1x;y;-  and 

||x||  =  x 2  +  x2  H - bx2.  For  A  £  R,  we  write  Ax  =  (Axi,Ax2,  . . . ,  Ax„). 

1.  Get  used  to  this  notation:  Let  x  =  (0, 1,2)  and  y  =  (—1,2,3)  in  R3.  What  is 
x-y?  What  is  ||x||?  ||y||?  x  —  y?  x  +  y?  Make  up  some  examples  in  R2  and  R4. 

2.  Keep  getting  used  to  this  notation:  What  is  the  set  {x  G  R2  :  ||x||  =  1}?  What 
is  {x  G  R3  :  ||x||  =  1}?  If  you  fix  x  G  R3,  what  is  {y  G  R3  :  ||x  —  y||  <  2}? 

3.  Letx  G  R"  and  A  G  R.  Prove  that  ||Ax||  =  |A|||x||. 

4.  Letx,y  G  R".  Prove  that  (x  +  y)  •  (x  +  y)  =  ||x  +  y||2. 

5.  Letx,y  G  R".  Prove  that  (x  — y)  •  (x  — y)  =  ||x|| 2  — 2(x-y)  +  ||y||2. 

6.  Let  x,y  G  R".  Find  a  similar  formula  for  (x  +  y)  •  (x  +  y). 

7.  Suppose  x  and  y  are  two  points  in  R”  such  that  ||x||  =  1  and  ||y||  =  1.  Prove 

that  |x - y |  <  1.  (Problem  5  above  together  with  the  fact  that  z-z  >  0  for  all 

z  G  R",  should  help  to  point  you  in  the  right  direction.) 

8.  Let  xGR".  Prove  that  if  x  ^  (0,0, ...  ,0),  then  j|x/||x||  ||  =  1. 

9.  Let  x  and  y  be  two  points  in  R".  Prove  that  |x-y|  <  ||x||  ||y||.  (Problems  7  and 
8  should  be  helpful  here.)  In  many  textbooks,  this  inequality  is  referred  to 
as  the  Cauchy-Schwarz  inequality;  others  call  it  the  Cauchy-Bunyakovsky- 
Schwarz  inequality. 

10.  Use  Problems  4  and  9  to  prove  that  for  x  and  y  in  R",  the  triangle  inequality 
holds;  that  is,  ||x  +  y||  <  ||x||  +  ||y||. 

11.  The  Cauchy-Bunyakovsky-Schwarz  inequality  can  be  used  to  prove  inter¬ 
esting  inequalities  about  real  numbers.  Use  it  to  prove  the  following:  Let 
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a\  ,02,  ■  ■  ■  ,a„  be  real  numbers.  Then 

Y^a)>  C^ajf/n. 

7=1  7=1 

12.  Bunyakovsky,  Cauchy,  and  Schwarz  all  have  their  names  attached  to  this  the¬ 
orem.  Who  proved  what,  and  when  did  they  prove  it? 


Open-Ended  Project 

For  two  points  x  and  y  in  R",  the  line  segment  joining  x  and  y  is  defined  by  the 
set  {z  £  R"  :  z  =  Ax  +  ( 1  —  A )y,  where  A  £  R  and  0  <  A  <  1}.  For  example,  in  R2 
choose  two  points,  say  (1,2)  and  (2,4).  Then  the  line  segment  joining  these  two 
points  is  the  set 

{(A  +2(1  -  A),2A  +  4(1  -  A)),0  <  A  <  1}  =  {(2-A,4-2A)  :  0  <  A  <  1}, 

which  is  indeed  the  line  segment  joining  the  two  points  (1,2)  and  (2,4).  Try  this  out 
on  other  points,  and  in  R3,  and  then  move  on  to  the  next  definition: 

A  nonempty  set  S  C  R"  is  said  to  be  convex  if  whenever  x,y  £  S,  then  the  line 
segment  joining  x  and  y  is  in  S.  Investigate  this  definition,  considering  the  following 
in  your  investigation.  (You’ll  find  the  triangle  inequality,  as  well  as  many  of  the 
exercises  above,  quite  handy.) 

1.  Show  that  {x  £  R"  :  ||x||  <  1}  is  convex. 

2.  Give  other  examples  of  convex  sets. 

3.  Is  the  union  of  two  convex  sets  convex? 

4.  Is  the  intersection  of  two  convex  sets  convex? 

5.  Now  return  to  part  2  and  see  if  you  can  come  up  with  other  interesting  examples. 

6.  What  are  some  other  interesting  questions  (and  answers)  about  convex  sets? 


Notes  and  Sources 

For  more  information  on  the  Cauchy-Bunyakovsky-Schwarz  inequality,  see  the  ar¬ 
ticle  by  P.  Schreiber  [96].  There  are  also  other  (more  clever,  less  intuitive)  ways  of 
proving  this  inequality. 


29. 1 1  Algebraic  Numbers 
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29.11  Algebraic  Numbers 
Introduction 

A  real  number  is  algebraic  if  it  is  the  root  of  a  polynomial 

p(x)  =  anxn-\ - haix  +  aoj 

where  n  is  a  positive  integer,  ao,ai,...,a„  £  Z,  and  cin  f  0.  A  real  number  is  tran¬ 
scendental  if  it  is  not  algebraic. 

It’s  easy  to  think  of  examples  of  algebraic  numbers:  0  is  algebraic,  because  it 
is  a  (the)  root  of  the  polynomial  p{x)  =  x;  the  number  1  / 2  is  algebraic,  because  it 
is  a  root  of  the  polynomial  q{x)  =  2x2  —x.  It’s  much  more  difficult  to  think  of  a 
number  that  is  not  algebraic.  Why?  Well,  suppose  you  have  a  guess  that  a  certain 
real  number  a  is  transcendental.  Then  to  prove  your  guess,  you  must  show  that  for 
every  polynomial  p  with  integer  coefficients,  p(a)  /  0.  Before  reading  on,  try  to 
guess  whether  there  are  more  transcendental  numbers  or  more  algebraic  numbers. 
In  an  1874  paper  Georg  Cantor  proved: 

Theorem  29.9  (Cantor).  There  are  countably  many  algebraic  numbers. 

In  this  project,  you  will  prove  this  theorem. 


Prerequisites 

This  project  requires  material  up  to  and  including  Chapter  23.  We  mention  one  ad¬ 
ditional  theorem  that  you  will  need  and  that  we  have  not  covered  yet,  namely  Theo¬ 
rem  29.15  stated  below.  If  you  work  Project  29.12,  you  will  also  have  a  proof  of  this 
theorem.  For  this  project,  however,  you  may  assume  the  validity  of  Theorem  29.15 
and  apply  it  to  complete  the  work  required  here. 

Theorem  29.15.  If  for  each  j  £  Z+  the  set  Aj  is  countable,  then  U/eZ+^;'  count¬ 
able. 


Guided  Project 

1 .  Familiarize  yourself  with  the  definition  of  an  algebraic  number  by  answering 
the  next  few  questions.  As  you  do  so,  you  should  also  get  an  idea  of  what 
transcendental  numbers  are  like,  and  you  will  begin  to  suspect  some  of  your 
old  numerical  friends  of  being  transcendentals. 

Come  up  with  examples  of  algebraic  real  numbers  that  have  not  been  pre¬ 
sented  in  this  project.  Are  some  rational  numbers  algebraic?  are  all  ratio¬ 
nal  numbers  algebraic?  What  about  the  irrational  numbers?  How  would  you 
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prove  that  a  particular  number  is  algebraic  or  transcendental?  Try  to  guess 
which  of  the  following  are  transcendental  numbers:  s/2, 5/7,7 r,  and  e.  (If  you 
think  one  of  these  numbers  is  algebraic,  prove  it.  It’s  beyond  our  capabilities, 
at  this  time,  to  prove  that  the  other  numbers  are  transcendental.) 

Now  you  should  be  ready  for  the  proof  of  Cantor’s  theorem.  The  proof  is  outlined 
below. 

2.  How  might  you  attack  the  proof  of  Cantor’s  theorem?  Does  it  remind  you  of 
anything  we  have  done  before?  What? 

3.  Solve  Problem  5.22,  if  you  haven’t  already  done  so. 

4.  Solve  Problem  23.10,  if  you  haven’t  already  done  so. 

5.  Recall  that  for  sets  A\ ,A2,  ■  ■  ■  ,An,  the  Cartesian  product  of  these  n  sets  is 

A\  x  A2  x  •  •  •  x  An  =  {(xi,X2,. . . ,x„)  :xj  sAj}. 

Prove  the  following  generalization  of  Corollary  23.10. 

Theorem  29.10.  Let  n  £  Z+.  If  A;  is  countable  for  all  i=  1,2 then 
A 1  x  A2  x  •  •  •  x  An  is  countable. 

6.  Let  n  €  Z+.  Show  that  the  set  of  polynomials  of  degree  n  with  integer  coeffi¬ 
cients  is  countable. 

7.  Prove  that  the  set  of  all  polynomials  with  integer  coefficients  is  countable. 

8.  Prove  that  the  set  of  algebraic  numbers  is  countable. 

9.  Show  that  there  exist  transcendental  numbers.  (Suggestion:  Don’t  try  to  actu¬ 
ally  find  such  a  number;  just  try  to  show  that  they  exist.) 

10.  Are  the  transcendental  numbers  countable  or  uncountable?  Prove  your  an¬ 
swer. 

You  have  now  seen  how  the  partition  of  the  reals  into  the  algebraic  numbers  and 
the  transcendental  numbers  works.  It  is  time  to  try  something  on  your  own. 


Open-Ended  Project 

Define  a  property  &  of  real  numbers  that  seems  to  be  of  value  to  you.  Let  A  denote 
the  set  of  all  reals  that  have  property  SP,  and  let  It  denote  the  set  of  all  reals  that 
do  not  have  property  S? .  Then  decide  for  each  of  the  two  sets,  A  and  It,  whether 
the  set  is  countable  or  not.  Prove  all  your  statements.  The  more  creative  you  are  in 
defining  £?,  the  harder  it  will  be  to  prove  the  countability  or  uncountability  of  your 
sets.  Here’s  a  chance  to  really  show  all  your  mathematical  prowess! 


29. 12  The  Axiom  of  Choice 
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Notes  and  Sources 

There  is  a  difference  between  proving  the  existence  of  transcendental  numbers,  and 
showing  that  a  particular  number  is  transcendental.  As  we  mention  in  the  Spotlight: 
Hilbert’s  Seventh  Problem,  proofs  that  the  numbers  71  and  e  are  transcendental  were 
given  around  the  time  of  Cantor’s  proof.  The  original  paper  by  G.  Cantor  [16]  is 
written  in  German.  A  brief  summary  of  Cantor’s  proof  can  be  found  in  M.  Kline’s 
book  [59,  pp.  996-997], 


29.12  The  Axiom  of  Choice 
Introduction 

Return,  for  the  moment,  to  Hilbert’s  Hotel  Infinity  discussed  in  Chapter  21.  Suppose 
that  in  each  room  there  is  exactly  one  pair  of  boots.  Can  you  come  up  with  a  rule  that 
chooses  one  boot  from  each  room?  To  be  mathematically  precise,  we’ll  introduce 
the  following  notation.  The  set  Bj  will  contain  the  two  boots,  and  only  the  two  boots, 
in  room  j.  Then  88  =  { Bj  :  j  G  Z  1  }  is  the  collection  of  all  these  sets  of  boots.  Can 
we  find  a  function  f  \  8fi  —>  Ujez+Bj  such  that  f(Bj)  G  Bj!  The  answer,  as  you 
probably  guessed,  is  “sure,  we  can  do  that.”  For  instance,  we  can  define  f(Bj)  to  be 
the  left  boot  in  Bj.  The  sentence  “y  is  the  left  boot  in  the  set  Bj”  is  a  statement.  Thus 
the  substitution  axiom  on  page  365  allows  us  to  obtain  the  desired  function. 

Suppose  now  that  the  hotel  guests  have  gone  out  and  taken  their  boots,  but  they 
have  forgotten  their  socks.  Assume  that  their  socks  are  identical;  that  is,  you  can¬ 
not  tell  the  right  sock  from  the  left  sock.  (This  is  true  of  most,  but  not  all,  socks!) 
Can  we  come  up  with  a  rule  that  chooses  one  sock  from  each  room?  We’ll  let  Sj 
denote  the  set  containing  the  two  socks,  and  only  the  two  socks,  of  room  j.  Then 
S?  =  { Sj :  j  G  Z+}  is  the  collection  of  all  these  sets  of  socks.  The  question  is  then, 
can  we  find  a  function/:  S?  — >  U/eZ+  Sj  such  that  f(Sj)  G  Sj!  This  time  it’s  difficult 
to  come  up  with  a  solution  to  this  problem;  in  fact,  we  don’t  have  a  rule  available 
along  the  lines  of  the  one  we  had  for  shoes.  Nevertheless,  intuitively  speaking,  it 
seems  as  though  it  should  be  possible  to  pick  one  sock  from  each  pair — at  least, 
most  mathematicians  think  this  is  intuitive.  We’ll  now  introduce  an  axiom  that  will 
allow  us  to  do  exactly  this. 

Axiom  29.11  (Axiom  of  Choice).  Given  a  nonempty  collection  of  nonempty 
sets,  there  is  a  function  / :  &  — >  \JAe^A  such  that  /(A)  G  A. 

There  are  many  statements  that  turn  out  to  be  equivalent  to  the  axiom  of  choice. 
Some  of  these  statements  are  major  theorems,  some  are  part  of  set  theory,  and  some 
are  theorems  in  other  fields  of  mathematics.  We’ll  just  mention  two  of  the  most 
important  ones  in  set  theory.  In  order  to  understand  the  statements  you  will  need  to 
review  Definitions  13.1  and  13.2. 
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We’ll  introduce  a  few  more  terms  before  we  begin  this  project.  You’ll  need  a 
firm  grasp  on  these  definitions,  so  work  through  some  examples  and  nonexamples 
of  each.  We  start  by  generalizing  Definition  19.3  to  arbitrary  partially  ordered  sets. 
Namely,  if  X  is  a  set  with  a  partial  order  A  and  A  is  a  nonempty  subset  of  X,  then  we 
call  a  £  X  an  upper  bound  of  A  if  x  A  a  for  all  x  £  A.  A  chain  in  a  partially  ordered 
set  X  is  a  subset  C  that  is  totally  ordered  when  the  order  on  X  is  restricted  to  C. 
A  least  element  in  a  partially  ordered  set  X  is  an  element  m  £  X  such  that  m  A  x 
for  all  x  £  X.  A  maximal  element,  M  £  X,  is  an  element  with  the  property  that  for 
all  x  £  X,  if  M  A  x,  then  M  =  x.  Finally,  a  partially  ordered  set  Y  is  well-ordered  if 
every  nonempty  subset  A  of  Y  has  a  least  element  in  A. 

Theorem  29.12  (Well-ordering  theorem).  Every  set  can  be  well-ordered. 

This  theorem  surely  reminds  you  of  the  well-ordering  principle  of  N  in  Chapter  12. 
In  some  ways  it  is  a  generalization,  as  the  theorem  above  applies  to  any  set — not  just 
to  the  natural  numbers.  But  be  careful:  the  well-ordering  theorem  does  not  tell  you 
about  a  particular  order  on  a  set;  it  just  claims  that  there  is  an  order  under  which  the 
set  is  well-ordered.  For  instance,  ffi.  with  the  usual  order  (less  than  or  equal  to)  is  not 
well-ordered.  (Why  not?)  The  theorem  claims  that  there  is  an  order  under  which  R 
is  well-ordered  but,  unfortunately,  it  does  not  give  us  any  hints  on  how  to  construct 
such  an  order.  Looking  at  it  from  this  perspective,  we  see  that  the  theorem  stated 
above  is  not  a  generalization  of  the  well-ordering  principle  of  N. 

It  turns  out  that  the  following  form  of  the  axiom  of  choice  is  particularly  useful. 
It  is  due  to  Max  Zorn,  a  much-loved  mathematician  as  can  be  seen  from  testimonies 
of  his  family  [114]  and  colleagues  [40]. 

Lemma  29.13  (Zorn’s  lemma).  Let  X  be  a  partially  ordered  set  such  that  every 
chain  in  X  has  an  upper  bound  in  X.  Then  X  contains  a  maximal  element. 

For  a  proof  of  the  equivalence  of  Zorn’s  lemma,  the  axiom  of  choice,  and  the  well¬ 
ordering  theorem  see  [41].  A  short  proof  that  the  axiom  of  choice  implies  Zorn’s 
lemma  is  in  [63].  Despite  the  fact  that  all  three  statements  are  equivalent,  people 
have  a  better  intuition  for  some  of  them  than  for  others.  The  American  mathemati¬ 
cian  Jerry  Bona  expressed  this  aptly:  “The  Axiom  of  Choice  is  obviously  true,  the 
well-ordering  principle  obviously  false,  and  who  can  tell  about  Zorn’s  lemma?” 

One  of  our  goals  in  this  project  is  to  help  you  understand  the  axiom  of  choice;  it 
will  appear  frequently  in  your  future  mathematics  courses.  But  our  main  goal  in  the 
guided  portion  of  this  project  is  to  show  that  we  can  find  a  total  order  that  allows  us 
to  compare  the  cardinalities  of  every  pair  of  sets  from  a  collection  of  sets,  sY . 


Prerequisites 

This  project  requires  that  you  have  a  basic  understanding  of  sets,  relations,  order, 
and  functions  as  explained  in  Chapters  5  through  17.  You  also  need  to  understand 
the  theory  of  cardinality  as  developed  in  Chapters  22  through  24. 
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Guided  Project 

In  the  introduction  to  this  project  we  presented  you  lots  of  definitions,  some  old  and 
some  new.  We  need  to  make  sure  that  you  fully  understand  them  before  proceeding 
to  the  project. 

1.  Solve  (or  re-solve)  Problems  13.13-13.15. 

2.  Consider  ^(N)  with  the  partial  order  C.  Give  an  example  of  an  interesting 
nonempty  finite  subset  of  ^  ( N )  that  is  a  chain  and  an  interesting  infinite 
subset  of  AA’(N)  that  is  a  chain.  Then  give  an  example  of  a  nonempty  finite 
subset  of  ^(N)  that  is  not  a  chain  and  an  infinite  subset  of  ^(N)  that  is 
not  a  chain — both  sets  should  be  interesting!  (We  define  “interesting”  in  this 
problem  to  mean  that  the  sets  are  nontrivial,  and  that  the  answers  to  part  3 
below  are  as  varied  as  possible.) 

3.  Write  down  the  definitions  of  greatest  element  and  minimal  element.  Explain 
the  difference  between  greatest  element  and  maximal  element  as  well  as  the 
difference  between  least  element  and  minimal  element.  For  each  of  your  four 
sets  in  part  2,  decide  whether  it  has  a  minimal,  maximal,  greatest,  and  least 
element. 

Let  srf  be  a  nonempty  collection  of  sets  and  denote  by  E  (,k/)  the  set  of  all  equiv¬ 
alence  classes  of  the  relation  «  on  srf  as  defined  in  Definition  21.1  (and  shown  to 
be  an  equivalence  relation  in  Theorem  21.1).  Thus  if  Ea  £  E  {,<//),  then  two  sets  A\ 
and  A 2  in  .&/  are  in  E,\  if  and  only  if  there  is  a  bijection  between  the  two  sets  A  i  and 

M- 

For  Ea,Er  £  E (&/),  we  will  let  A'  denote  a  set  in  Ea  and  B'  denote  a  set  in  Eg- 
We  define  the  relation  A  on  E (/a/)  by  E,\  A  Eb  if  \A'\  <  \B'\  (that  is,  there  is  an 
injective  function  /  :  A'  — ►  B'). 

We  aim  to  establish  the  theorem  below,  using  the  terminology  introduced  above. 
This  theorem  tells  us  that  we  can  compare  the  cardinality  of  any  two  sets  and  that 
this  comparison  obeys  the  usual  rules  of  a  total  order. 

Theorem  29.14.  For  a  nonempty  collection  ■<//  of  sets,  the  relation  A  is  a  total  order 
on  E(£/). 

4.  If  you  have  not  yet  done  so,  work  Problem  24.4. 

5.  Prove  that  the  relation  A  on  E(g/),  as  defined  above,  is  well-defined.  That  is, 
show  that  given  EA .  En  £  E  (.<//)  we  can  always  find  A'  £  Ea  and  B'  £  Eg  and 
that  the  definition  of  A  does  not  depend  on  the  particular  choice  of  A'  and  B' . 

6.  Using  the  result  of  part  4,  show  that  A  is  a  partial  order  on  E(&/). 

7.  For  two  sets  A  and  B  we  say  that  /  is  a  partial  function  from  A  to  B  it 
/  :  A'  — ►  B  for  some  A'  C  A.  Let  &  be  the  set  of  all  injective  partial  func¬ 
tions  from  A  to  B.  We  define  a  relation  I  on  &  as  follows:  For  f\  :  A  \  — >  B 
and  f2  :  A2  — >  B,  partial  functions  in  & , 

f\  h  /2  if  4|  =  dom(/i)  C  dom(/2)  =A2  and 
f\  is  the  restriction  of  f2  to  A  \ . 


356 


29  Projects 


Prove  that  h  is  a  partial  order  on 

8.  Let  A  and  B  be  two  sets  such  that  |A|  |ZJ|.  Prove  that  |A|  <  \B\  using  the 

construction  from  part  7  of  this  project  and  Zorn’s  lemma. 

9.  Prove  Theorem  29. 14. 

Now  we  turn  our  attention  to  the  following  theorem,  which  is  a  generalization  of 
Corollary  23.7. 

Theorem  29.15.  If  for  each  j  £  IN  the  set  A  j  is  countable,  then  U/gz+^f  count¬ 
able. 

10.  Prove  Theorem  29.15.  Note  that  induction  will  not  work  here.  We  suggest 
that  you  adapt  the  ideas  of  the  alternate  proof  of  Theorem  23.11  outlined  in 
Problem  23.12. 

11.  Say,  explicitly,  where  you  used  the  axiom  of  choice  in  the  above  proof. 


Open-Ended  Project 

This  part  is  an  exercise  in  the  history  of  mathematics.  We  will  ask  you  to  find 
resources,  summarize  your  findings,  and  present  them  in  a  coherent  and  interest¬ 
ing  way:  Research  the  history  of  the  axiom  of  choice.  Find  variations  and  give  an 
overview  of  equivalent  forms  of  the  axiom  of  choice.  What  does  mathematical  con¬ 
structivism  think  of  the  axiom  of  choice?  (To  answer  the  last  question  you  will  need 
to  find  out  what  mathematical  constructivism  is!) 


Notes  and  Sources 

There  are  many  books  that  explore  the  history  of  the  axiom  of  choice,  its  implica¬ 
tions,  and  its  equivalent  forms.  Two  of  them  are  [48]  and  [91].  The  first  part  of  the 
guided  project  is  based  on  [28].  Finally,  we  note  that  there  are  many  jokes  based 
on  Zorn’s  lemma  and  the  axiom  of  choice.  Every  budding  mathematician  should 
know  the  following,  “What  is  yellow,  sour,  and  equivalent  to  the  axiom  of  choice?” 
Answer:  “Zorn’s  lemon.” 

Mathematicians  are  funny. 
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29.13  The  RSA  Code 
Introduction 

Though  coding  theory  has  always  been  important,  a  giant  leap  forward  occurred  in 
the  second  half  of  the  twentieth  century,  with  the  invention  of  public  key  cryptogra¬ 
phy.  The  main  idea  (due  to  W.  Diffie  and  M.  Heilman)  is  the  concept  of  a  trapdoor 
function — a  function  that  has  an  inverse,  but  the  inverse  is  very  difficult  to  find.  In 
fact,  it  should  require  so  long  for  someone  who  did  not  invent  the  original  func¬ 
tion  to  find  the  inverse  that,  for  all  practical  purposes,  the  inverse  does  not  exist. 
In  1976,  R.  L.  Rivest,  A.  Shamir,  and  L.  M.  Adleman  succeeded  in  finding  such  a 
class  of  functions,  and  their  idea  is  based  upon  one  of  the  most  elementary  ideas  in 
mathematics — multiplication  of  two  numbers.  (See  the  Spotlight:  Public  and  Secret 
Research  in  Chapter  28.) 

It  turns  out  that  if  you  take  two  very  large  numbers  and  multiply  them  together,  a 
machine  can  quickly  compute  the  answer.  But,  if  you  give  the  machine  the  answer 
and  ask  for  two  factors,  the  factorization  will  not  appear  in  a  useful  amount  of  time. 
The  public  key  system,  built  upon  these  ideas,  is  now  known  as  RSA-key  (after 
the  three  men  who  created  the  system).  It  was  described  in  Chapter  28,  but  in  this 
project  you  will  learn  the  details. 


Prerequisites 

We  assume  that  you  worked  Chapters  27  and  28  on  modular  arithmetic  and  Euler’s 
theorem.  In  particular,  we  will  refer  to  the  description  of  the  public  code  that  was 
given  at  the  end  of  Chapter  28.  The  notation  was  introduced  in  the  chapter.  You  will 
also  need  a  good  calculator;  one  that  is  able  to  determine  whether  a  number  is  prime, 
can  factor  an  integer,  and  can  do  modular  arithmetic.  For  some  parts  of  this  project, 
you  will  need  to  use  Mathematica.  (If  you  don’t  have  access  to  Mathematica,  you 
can  skip  the  parts  that  require  it.) 


Guided  Project 

1 .  Reread  the  paragraphs  of  Chapter  28  following  the  proof  of  Euler’s  theorem. 

2.  Let’s  start  with  a  small  example  to  make  sure  we  understand  the  basics  of  the 
code:  We  choose  p  =  13  and  q=ll  (so  that  n  =  pq  =  13  •  17  =  221).  For  the 
encoding  exponent  we  choose  e  =  1 1 .  Verify  that  e  =  1 1  is  a  feasible  choice 
for  the  encoding  exponent.  Use  the  public  key  (n,e)  to  calculate  the  “secret” 
value  of  d.  Encode  the  following  three  plaintexts: 


(a)  m  =  157; 
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(b)  ;7i  =  97216; 

(c)  777  =  91. 
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Decode  them  again  to  convince  yourself  that  the  method  works. 

3.  Note  that  gcd(91, 13  •  17)  =  13  ^  1,  so  the  hypothesis  of  Example  28.8  is  not 
satisfied.  It  turns  out  that  the  method  still  works,  even  in  this  case.  Let’s  try  to 
see  why  it  still  works — what  could  go  wrong?  In  this  code,  we  always  assume 
that  77  is  the  product  of  two  primes:  n  =  pq,  where  p  and  q  are  primes.  Thus, 
the  gcd(777,  77)  is  p.q.  or  1.  If  gcd(7?r, w)  =  1,  then  Example  28.8  applies.  Prove 
that  even  if  gcd(77i,7z)  =  p  (or  q),  the  decoding  with  exponent  d  still  works: 

Lemma  29.16.  Let  m,n  €  Z  with  0  <  m  <  n,  gcd(7?7,7z)  =  p,  and  n  =  pq  for 
primes  p  and  q  with  p  f  q.  Further,  let  e,d  €  Z  with  ed  =  1  (mod  </>(tz)). 
Then  med  =  m  (mod  n). 

You  will  need  Theorem  28.10  (appearing  in  the  problem  section)  and  Theo¬ 
rem  28.7  for  this  proof. 

4.  In  practice,  n  must  be  chosen  to  be  quite  large — certainly  larger  than  10.  Nev¬ 
ertheless,  it  may  still  be  the  case  that  the  plaintext  m  may  satisfy  m  >  n.  Recall 
that  if  777  >  77,  then  we  have  to  break  the  integer  m  into  parts.  Here’s  how  to 
do  this:  Choose  positive  integers  mi ,  m2, . . . ,  np  such  that  777,  <  n  for  1  <  i  <  k 
and  777  =  777 1 1 7720 1 . . .  \mic,  where  the  last  expression  denotes  simple  lining  up 
of  the  integers  in  the  decimal  notation  for  m.  (For  example,  if  m  =  15208 
and  77  =  77,  we  can  take  mi  =  15,  m2  =  20,  and  7773  =  8.  Then  m  =  15|20|8.) 
We  will  then  denote  the  (chopped  up)  plaintext  as  (7771,7772, . . . , 777^)  and  the 
ciphertext  as  (777;  (mod  n),me2  (mod  n),...  ,mek  (mod  77)). 

Now  you  are  ready  for  the  problem:  Suppose  you  are  given  the  public  key, 
77  =  2881  and  e  =  47.  The  intercepted  message  contains  the  criminal’s  hair 
color.  However,  the  message  is  encoded  according  to  the  rules  we  described 
in  the  previous  paragraph.  The  ciphertext  reads  (2574, 1 120, 166,742)  (all  in¬ 
tegers  (mod  2881)).  The  translation  from  letters  to  integers  is  done  by  con¬ 
verting  a  —7  01, b  —*■  02, . . .  ,z  —  26.  Crack  the  code  to  find  out  the  criminal’s 
hair  color. 

5.  If  you  cracked  the  message  in  the  previous  part,  then  it  is  obvious  that  this 
encryption  is  not  safe.  That’s  because  the  function  we  used  in  that  part  of  the 
problem  is  not  really  a  trapdoor  function.  However,  it  will  become  one  if  we 
choose  our  primes  large  enough.  The  bigger  the  primes,  the  harder  it  is  to 
factor  77  (a  task  believed  to  be  necessary  to  break  the  code).  To  get  a  feeling 
for  the  unequal  amount  of  time  it  takes  to  find  primes  and  multiply  versus 
factoring,  do  the  following  on  your  calculator. 

(a)  By  trial  and  error  using  the  calculator’s  prime  check,  find  two  primes 
of  ten  digits  each.  (Primality  testing  is  also  an  interesting  and  important 
subject.  Your  calculator  uses  sophisticated  algorithms  to  check  whether 
an  integer  is  prime.) 

(b)  Multiply  the  two  integers  together.  (Notice  how  quickly  your  calculator 
can  do  that!) 
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(c)  Now  use  the  factor  command  to  factor  the  number  you  obtained  into 
its  two  primes.  How  long  did  it  take? 

6.  To  do  safe  encoding  with  the  RSA  method  you  need  huge  primes.  Currently 
the  recommendation  is  to  use  primes  of  300  decimal  digits  each.  If  you  have 
access  to  Mathematica,  download  the  notebook  RSA-Notebook.nb  from  the 
site  given  below.  Explore  this  package  and  use  it  to  communicate  with  a  class¬ 
mate,  creating  public  keys  and  sending  messages  to  each  other. 
http://library.wolfram.com/infocenter/MathSource/1966/ 


Open-Ended  Project 

Either  create  a  code  of  your  own,  or  find  a  code  from  another  book.  Try  your  code 
out  on  a  partner,  compare  it  to  RSA,  and  discuss  the  strengths  and  weaknesses  of 
your  code. 


Notes  and  Sources 

The  original  paper  by  R.L.  Rivest,  A.  Shamir,  and  L.M.  Adleman  appears  in  [88].  A 
more  detailed  treatment  can  be  found  in  the  general  number  theory  text  book  by  K. 
H.  Rosen  [89,  Chapter  8].  See  [90]  for  the  commercial  site  of  RSA  Security  Inc.,  a 
company  founded  by  Rivest,  Shamir,  and  Adleman. 

To  learn  about  primality  testing,  you  can  start  with  the  Mathematics  Magazine 
article  [70]  that  gives  a  historical  treatment  of  the  subject  up  to  the  use  of  comput¬ 
ers.  A  comprehensive  treatment  at  the  undergraduate  level  is  contained  in  the  text 
by  Bressoud  [14],  Also,  a  recent  breakthrough  is  presented  in  the  more  advanced 
paper  [1], 


Spotlight:  Hilbert’s  Seventh  Problem 

In  1900,  David  Hilbert  presented  a  speech  in  Paris  entitled  “Mathematische  Prob- 
leme”  to  the  International  Congress  of  Mathematicians.  His  aim  was  to  look  at  the 
future  of  mathematics.  His  speech  began  with  a  description  of  what  makes  a  prob¬ 
lem  significant.  This  introduction  is  followed  by  the  statement  and  discussion  of  23 
problems.  His  speech  appeared  in  1900  in  the  Nachrichten  of  the  Gottingen  Scien¬ 
tific  Society  (more  precisely,  in  Nachrichten  von  der  Koniglichen  Gesellschaft  der 
Wissenschaften  zu  Gottingen).  It  was  translated  into  English,  and  published  in  1902 
in  the  Bulletin  of  the  American  Mathematical  Society. 
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Information  about  the  time  leading  up  to  and  following  the  presentation  can  be 
found  in  C.  Reid’s  biography  Hilbert  [86].  We  present  the  English  translation  of  the 
seventh  problem  below.  Even  today,  it’s  exciting  to  hold  a  copy  of  this  speech  in 
your  hands. 

(You  can  find  a  definition  of  algebraic  and  transcendental  numbers  in  Project 
29.11.) 

Hermite’s  arithmetical  theorems  on  the  exponential  function  and  their  extension  by  Lin- 
demann  are  certain  of  the  admiration  of  all  generations  of  mathematicians.  Thus  the  task 
at  once  presents  itself  to  penetrate  further  along  the  path  here  entered,  as  A.  Hurwitz  has 
already  done  in  two  interesting  papers,1  “Ueber  arithmetische  Eigenschaften  gewisser  tran- 
szendenter  Funktionen.”  I  should  like,  therefore,  to  sketch  a  class  of  problems  which,  in 
my  opinion,  should  be  attacked  as  here  next  in  order.  That  certain  special  transcenden¬ 
tal  functions,  important  in  analysis,  take  algebraic  values  for  certain  algebraic  arguments, 
seems  to  us  particularly  remarkable  and  worthy  of  thorough  investigation.  Indeed,  we  ex¬ 
pect  transcendental  functions  to  assume,  in  general,  transcendental  values  for  even  algebraic 
arguments;  and,  although  it  is  well  known  that  there  exist  integral  transcendental  functions 
which  even  have  rational  values  for  all  algebraic  arguments,  we  shall  still  consider  it  highly 
probable  that  the  exponential  function  e'nz,  for  example,  which  evidently  has  algebraic  val¬ 
ues  for  all  rational  arguments  z,  will  on  the  other  hand  always  take  transcendental  values  for 
irrational  algebraic  values  of  the  argument  z.  We  can  also  give  this  statement  a  geometrical 
form,  as  follows: 

If,  in  an  isosceles  triangle,  the  ratio  of  the  base  angle  to  the  angle  at  the  vertex  be  algebraic 
but  not  rational,  the  ratio  between  base  and  side  is  always  transcendental. 

In  spite  of  the  simplicity  of  this  statement  and  of  its  similarity  to  the  problems  solved  by 
Hermite  and  Lindemann,  I  consider  the  proof  of  this  theorem  very  difficult;  as  also  the  proof 
that 

The  expression  ,  for  an  algebraic  base  a  and  an  irrational  algebraic  exponent  j8,  e.  g., 
the  number  2 ^  or  en  =  i  2l,  always  represents  a  transcendental  or  at  least  an  irrational 
number. 

It  is  certain  that  the  solution  of  these  and  similar  problems  must  lead  us  to  entirely  new 
methods  and  to  a  new  insight  into  the  nature  of  special  irrational  and  transcendental  num¬ 
bers.  [51,  pp.  455-456]. 

Hilbert  mentions  Charles  Hermite,  who  proved  in  1873  that  e  is  transcendental, 
and  Ferdinand  Lindemann,  who  proved  in  1882  that  K  is  transcendental  [26,  p.  466]. 
The  answer  to  Hilbert’s  question  was  published  in  1934  by  Aleksandr  O.  Gelfond, 
and  (independently)  by  Theodor  Schneider  in  1935.  It  follows  from  the  Gelfond- 

Schneider  theorem  that  s/2  ~  is  irrational  (see  Project  29.4),  but  there’s  an  easier 
example.  You  can  find  this  easier  solution  at  [104]. 

Hilbert’s  original  address  can  be  found  in  [50].  The  full  text  of  the  English  trans¬ 
lation  is  available  on  the  Web,  [51].  See  also  [59,  Chapter  25,  sec.  1]  and  [59,  p. 
980].  For  another  view  of  Hilbert’s  problems  read  [36],  and  for  a  recent  book  on 
this  topic  see  [37], 

In  honor  of  the  100-year  anniversary  of  Hilbert’s  Paris  address,  the  new  century, 
and  the  new  millenium,  several  mathematicians  were  asked  to  pose  problems  for 
the  next  century.  Steve  Smale  proposed  18  problems  for  your  century  that  you  can 

1  Math.  Ann.,  vols.  22,  32  (1883,  1888). 
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find  in  [100].  The  article  [39]  by  Phillip  Griffiths  also  contains  a  look  at  challenges 
for  the  future.  The  Clay  Mathematics  Institute  of  Cambridge,  Massachusetts  (CMI) 
selected  seven  problems  for  the  new  millenium.  They  also  offer  a  reward  of  one 
million  dollars  per  problem,  and  consequently  have  received  a  fair  amount  of  pub¬ 
licity.  More  information  about  the  Institute  and  the  problems  can  be  found  on  their 
website,  [19],  as  well  as  in  [21]. 


Appendix 


Algebraic  Properties  of  M 

We  will  assume  that  you  are  familiar  with  the  following  properties  of  R. 

If  x  and  y  are  real  numbers,  then  both  x+y  and  x ■  y  are  real  numbers.  Furthermore, 
addition  and  multiplication  satisfy  the  following  axioms: 

Al.  (The  commutative  property  for  addition)  x  +  y  =  y  +x  for  all  real  numbers  x 
and  y; 

A2.  (The  associative  property  for  addition)  (x  +  y)  +z  =  x+  (y  +  z)  for  all  real 
numbers  x,y,  and  z; 

A3.  (Existence  of  additive  identity)  There  is  a  unique  real  number  0  such  that 
0 + x  =  x  for  all  x  €  R; 

A4.  (Existence  of  additive  inverse)  If  x  €  R,  then  there  is  a  unique  element  — x 
such  that  x+  (— x)  =  0; 

M 1 .  (The  commutative  property  for  multiplication)  x  •  y  =  y  •  x  for  all  real  numbers 
x  and  y; 

M2.  (The  associative  property  for  multiplication)  (x  •  y)  •  z  =  x  •  (y  •  z)  for  all  real 
numbers  x,y,  and  z,\ 

M3.  (Existence  of  multiplicative  identity)  There  is  a  unique  real  number  1,  with 
1  ^  0,  such  that  1  •  x  =  x  for  all  real  numbers  x. 

M4.  (Existence  of  multiplicative  inverse)  For  each  nonzero  real  number  x,  there 
exists  a  unique  real  number  x~ 1  such  that  x  ■  x  1  =  1 ; 

D 1 .  (The  distributive  property)  (x+y)-z  =  x-  z+y-z  for  all  real  numbers  x, y,  and 
z. 

We  note  that  this  list  of  properties  is  not  minimal;  for  example,  the  uniqueness  of 
the  additive  identity  0  follows  from  some  of  the  other  properties  in  the  list. 
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Order  Properties  of  M. 

A  set  satisfying  all  of  the  properties  above  is  called  a  field.  Thus,  R  is  an  example 
of  a  field.  In  addition,  M  has  an  order  defined  on  it.  This  means  the  following: 

There  is  a  subset  R+  of  M\  {0}  satisfying: 

Ol.  If  x,y  £  R+,  then x-y  £  R+; 

02.  If  x,y  £  R+,  then  x  +  y  £  R+; 

03.  For  every  real  number  x,  exactly  one  of  the  following  three  things  happens: 
either  x  £  R+,  —  x  £  R+,  or  x  =  0. 

If  x  and  y  are  two  real  numbers  and  x  —  y£  R+,  we  write  x  >  y  for  v  <  x).  The  set 
R+  is  called  the  positive  real  numbers.  Thus  R  is  a  field  with  an  order,  and  we 
call  it  an  ordered  field.  The  third  property,  03,  is  called  the  trichotomy  principle. 
It  is  not  difficult  to  show  that  the  results  below  follow  from  the  statements  A1-A4, 
M1-M4,  Dl,  and  01-03. 

Theorem.  Let  x.y,  and  z  be  real  numbers.  Then  the  following  hold: 

1.  If  x  <  y  and  y  <  z,  then  x  <  z; 

2.  Ifx<  y,  then  x  +  z<y  +  z; 

3.  If  x  <  v  and  z  >  0,  then  x  ■  z  <  y  •  z; 

4.  If  x  <  y  and  z  <  0,  then  x-z>  y-z; 

5.  If  x  f  0,  then  x2  >  0; 

6.  1  >  0; 

7.  If  x  >  0,  then  x~ 1  >0. 

Proof  Well  do  the  first  and  the  sixth  of  these;  you  can  prove  the  others. 

For  the  proof  of  (1),  note  that  y  —  x£  R+  and  z  —  y£  R+.  By  02  and  the  as¬ 
sociative  and  commutative  properties  of  addition,  (y  —  x)  +  (z  —  y)  =  z  —  x  £  R+. 
Therefore  z  —  x£  R+  and  x  <  z. 

For  the  proof  of  (6),  note  that  1  is  the  multiplicative  identity,  so  1  x  =  x  for  all 
x  £  R.  Taking  x  =  1,  we  get  l2  =  1  •  1  =  1.  Since  1  /  0,  the  result  now  follows  from 
(5).  □ 


Axioms  of  Set  Theory 

To  give  set  theory  and  large  parts  of  mathematics  a  firm  foundation,  axioms  were 
developed  upon  which  mathematicians  could  agree.  The  rest  of  set  theory,  then, 
needs  to  follow  from  these  axioms  using  the  rules  of  logic.  Currently  the  generally 
accepted  axiomatic  system  is  that  due  to  Ernst  Zermelo  and  Abraham  Fraenkel, 
together  with  the  axiom  of  choice.  The  abbreviation  ZFC  is  commonly  used  for  this 
system.  (This  list  of  axioms  follows  that  of  [41],  except  for  the  axiom  of  choice 
where  we  preferred  a  different  version.) 
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ZFC  1  (Axiom  of  extension)  Two  sets  are  equal  if  and  only  if  they  have  the  same 
elements. 

ZFC  2  (Axiom  of  specification)  For  every  set  A  and  every  condition  S(x),  there 
corresponds  a  set  B  whose  elements  are  exactly  those  elements  x  of  A  for 
which  S(x)  holds. 

ZFC  3  (Axiom  of  pairing)  For  every  two  sets  there  exists  a  set  to  which  they  both 
belong. 

ZFC  4  (Axiom  of  unions)  For  every  collection  of  sets  there  exists  a  set  that  contains 
all  the  elements  that  belong  to  at  least  one  set  of  the  given  collection. 

ZFC  5  (Axiom  of  powers)  For  each  set  there  exists  a  collection  of  sets  that  contains, 
among  its  elements,  all  the  subsets  of  the  given  set. 

ZFC  6  (Axiom  of  infinity)  There  exists  a  set  containing  0  and  containing  the  suc¬ 
cessor  of  each  of  its  elements. 

(Recall  that  0  =  0  and  the  successor  of  x  is  x+  =rU{r}.) 

ZFC  7  (Axiom  of  substitution)  If  A  is  a  set  and  S(a,b)  is  a  sentence  such  that  for 
each  a  in  A  the  set  {b  :  S(a,b)}  can  be  formed,  then  there  exists  a  function 
F  :  A  — >  {{b  :  S(a,b)}  :  a  £  A}  such  that  F(a)  =  {b  :  S(a,b)}. 

ZFC  8  (Axiom  of  choice)  Given  a  nonempty  collection  &  of  nonempty  sets,  there 
is  a  function  /  :  &  — >  |JA6^r  A  such  that  /(A)  £  A. 
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Polya’s  List 


0  From  the  inside  cover  of  George  Polya,  How  to  Solve  it  [84],  Copyright  ©1945,  1973  renewed 
by  Princeton  University  Press.  Reprinted  by  pennission  of  Princeton  University  Press. 
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